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(54) Subtilisin variants 

(57) Novel carbonyl hydrolase variants derived from 
the DNA sequences of naturally-occuring or recom- 
binant non-human carbonyl hydrolases are disclosed. 
The variant carboryl hydrolases, in general, are ob- 
tained by in vitro modification of a precursor DNA se- 
quence encoding the naturally-occurring or recom- 
binant carbonyl hydrolase to generate the substitution 
of a plurality of amino acid residues in the amino acid 
sequence of a precursor carbonyl hydrolase. Such var- 
iant carbonyl hydrolases have properties which are dif- 



ferent from those of the precursor hydrolase, such as 
altered proteolytic activity, altered stability, etc. The sub- 
stituted amino acid residues correspond to positions 
+76 in combination with one or more of the following res- 
idues +99, +1 01 , +1 03, +1 04, +1 07, +1 23, +27, +1 05, 
+1 09, +1 26, +1 28, +1 35, +1 56, +1 66, +1 95, +1 97, +204, 
+206, +21 0, +21 6, +21 7, +21 8, +222, +260, +265 and/ 
or +274 in bac'iHus amyioltquefaciens subtilisin, 
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Description 

Cross- Reference to Related Applications 

5 [0001] This application is a continuation-in-part of US Application Serial Number 08/137,240 filed October 14, 1993 
(pending) and which is Incorporated herein by reference in its entirety. 

Field of the Invention 

10 [0002] The present invention relates to novel carbonyi hydrolase variants having an amino acid sequence wherein 
a plurality of amino acid residues of a precursor carbonyi hydrolase, specifically those at positions corresponding or 
equivalent to residue +76 in combination with one or more of the residues selected from the group consisting of +99, 
+101. +103. +104, +107, +123, +27, +105, +109, +126, +128, +135, +156, +166, +195, +197, +204, +206, +210. +216. 
+217, +218, +222, +260, +265 and/or +274 In Baalfus amyiotlquefaciens subtlllsin, have been substituted with a dif- 
ferent amino acid. Such mutant/variant carbonyi hydrolases, In general, are obtained by In vitro modification of a pre- 
cursor DNA sequence encoding a naturally-occurring or recombinant carbonyi hydrolase to encode the substitution of 
a plurality of these amino acid residues in a precursor amino acid sequence alone or in combination with other substi- 
tution, Insertion or deletion in the precursor amino acid sequence. 

20 Background of the Invention 

[0003] Serine proteases are a subgroup of carbonyi hydrolase. They comprise a diverse class of enzymes having a 
wide range of specificities and biological functions. Stroud, R. Sci. Amer., 131 :74-88. Despite their functional diversity, 
the catalytic machinery of serine proteases has been approached by at least two genetically distinct families of en- 
25 zymes: the subtilisins and the mammalian chymotrypsin related and homologous bacterial serine proteases (e.g., 
trypsin and S. greslusXxypsm) These two families of serine proteases show remarkablysimilar mechanisms of catalysis. 
Kraut, J. (1 977), Ann. Rev. Blochem., 46:331-358. Furthermore, although the primary structure is unrelated, the tertiary 
structure of these two enzyme families bring together a conserved catalytic triad of amino acids consisting of serine, 
histldine and aspartate. 

30 [0004] Subtiiisin is a serine endoprotease (iVIW 27,500) which is secreted in large amounts from a wide variety of 
Bacillus species and other microorganisms. The protein sequence of subtiiisin has been determined from at least four 
different species of Baciflus. IVIarkland, F.S., et al. (1983), Honne-Seyler's Z. Physiol. Chem. , 364:1537-1540, The 
three-dimensional crystall ©graphic structure of Bacifius amytoUquefaciens subtiiisin to 2.5A resolution has also been 
reported. Wright, C.S., et al. (1 969), Nature, 221 :235-242; Drenth, J., et al. (1 972), Eur. J. Biochem., 26:177-1 81 . These 

35 studies indicate that although subtiiisin is genetically unrelated to the mammalian serine proteases, it has a similar 
active site structure. The x-ray crystal structures of subtiiisin containing covalently bound peptide inhibitors (Robertus, 
J.D., et al. (1 972), Biochemistry , 1 1 :2439-2449) or product compl9xes (Robertus, J.D., et al. (1976), J. Biol. Chem. , 
251 : 1 097-11 03) have also provided information regarding the active site and putative substrate binding cleft of subtiiisin. 
In addition, a large number of kinetic and chemical modification studies have been reported for subtiiisin (Philipp, iVI., 

40 etal. (1983), Mol. Cell. Blochem., 51:5-32; Svendsen, B. (1976), Carlsberg Res. Comm. , 41:237-291; Markland, F.S. 
Id.) as well as at least one report wherein the side chain of methionine at residue 222 of subtiiisin was converted by 
hydrogen peroxide to methionine-sulfoxide (Stauffer, D.C., et al. (1965), J. Biol. Chem., 244:5333-5338) and the side 
chain of serine at residue 221 converted to cysteine by chemical modification (Polgar, et al. (1981), Biochimica et 
Biophvsica Acta . 667:351 -354.) 

45 [0005] US Patent 4,760,025 (RE 34,606) discloses the modification of subtiiisin amino acid residues corresponding 
to positions in BacHfus amytoUquefaciens subtiiisin tyrosine -1 , aspartate +32, asparagine +155, tyrosine +104, me- 
thionine +222, glycine +1 66, histldine +64, glycine +1 69, phenylalanine +1 89, serine +33, serine +221 , tyrosine +21 7, 
glutamate +156 and alanine +152. US Patent 5,182,204 discloses the modification of the amino acid +224 residue In 
Baciltus amylotiquefacfens subtiiisin and equivalent positions in other subtilisins which may be modified by way of 

50 substitution, insertion or deletion and which may be combined with modifications to the residues identified in US Patent 
4,760,025 (RE 34,606) to form useful subtiiisin mutants or variants. US Patent 5,155,033 discloses similar mutant 
subtilisins having a modification at an equivalent position to +225 of B. amy/oflquefaclens subtiiisin. US Patents 
5,185,258 and 5,204,015 disclose mutant subtilisins having a modification at positions +123 and/or +274. The disclo- 
sure of these patents Is Incorporated herein by reference, as Is the disclosure of US Patent Application SN 07/898,382, 

55 which discloses the modification of many amino acid residues within subtiiisin, including specifically +99, +1 01 , +1 03, 
+1 07, +1 26, +1 28, +1 35, +1 97 and +204. All of these patents/applications are commonly owned. US Patent 4,91 4,031 
discloses certain subtiiisin analogs, Including a subtiiisin modified at position +76. The disclosure of this patent is also 
incorporated herein by reference. The particular residues identified herein and/or the specific combinations claimed 



2 



EP1 526182 A2 



herein, however, are not Identified In these references. 

[0006] Accordingly, It Is an object herein to provide carbonyl hydrolase (preferably subtillsin) variants containing the 
substitution of a plurality of amino acid residues In the DNA encoding a precursor carbonyl hydrolase corresponding 
to positions +76 in combination with one or more positions selected from the group +99, +1 01 , +1 03, +1 04, +1 07, +1 23, 
5 +27,+105,+109,+126, +128,+135,+156,+166,+195,+197, +204,+206,+210,+216,+217,+218,+222,+260,+265 
and/or +274 in Bacilfus amyloliquefaciens subt\\\s\n. Such variants generally have at least one property which is different 
from the same property of the carbonyl hydrolase precursor from which the amino acid sequence of said variant is 
derived. 

[0007] It is a further object to provide DNA sequences encoding such carbonyl hydrolase variants, as well as ex- 
^0 pression vectors containing such variant DNA sequences. 

[0008] Still further, another object of the Invention is to provide host ceils transformed with such vectors, as well as 
host cells which are capable of expressing such DNA to produce carbonyl hydrolase variants either intracellularly or 
extracellularly. 

[0009] The references discussed above are provided soieiy for their disclosure prior to the filing date of the instant 
15 case, and nothing herein is to be construed as an admission that the inventors are not entitled to antedate such dis- 
closure by virtue of a prior invention or priority based on earlier filed applications. 

Summary of the Invention 

20 [0010] The invention includes non-naturally-occurring carbonyl hydrolase variants having a different proteolytic ac- 
tivity, stability, substrate specificity, pH profile and/or performance characteristic as compared to the precursor carbonyl 
hydrolase from which the amino acid sequence of the variant Is derived. The precursor carbonyl hydrolase may be a 
naturally-occurring cartDonyl hydrolase or recombinant hydrolase. Specifically, such carbonyl hydrolase variants have 
an amino acid sequence not found in nature, which is derived by replacement of a plurality of amino acid residues of 

25 a precursor carbonyl hydrolase with different amino acids. The plurality of amino acid residues of the precursor enzyme 
correspond to position +76 in combination with one or more of the foiiowing residues +99, +101 , +1 03, +1 04, +1 07, 
+1 23, +27, +1 05, +1 09, +126, +128, +1 35, +1 56, +1 66, +1 95, +1 97, +204, +206, +21 0, +21 6, +21 7, +218, +222, +260, 
+265 and/or +274, where the numbered position corresponds to naturally-occurring subtillsin from Bacillus amytotiq- 
uefacfens or to equivalent amino acid residues In other carbonyl hydrolases or subtillslns, such as BaciUus ientus 

30 subtillsin. The carbonyl hydrolase variants of the present Invention comprise replacement of amino acid residue +76 
in combination with one or more additional modifications. Preferably the variant enzymes of the present invention 
comprise the substitution, deletion or insertion of amino acid residues in the foiiowing combinations: 76/99; 76/101; 
76/103; 76/104; 76/107; 76/123; 76/99/1 01 ; 76/99/1 03; 76/99/104; 76/101/1 03; 76/101/104; 76/103/104; 76/1 04/107; 
76/104/123; 76/107/123; 76/99/101/103; 76/99/101/104; 76/99/103/104; 76/101/103/104; 76/103/104/123; 

35 76/104/107/123; 76/99/101/103/104; 76/99/103/104/123; 76/99/101/103/104/123; 76/103/104/128; 76/103/104/260; 
76/1 03/1 04/265; 76/1 03/1 04/1 97; 76/1 03/1 04/1 05; 76/1 03/1 04/1 35; 76/1 03/1 04/1 26; 76/1 03/1 04/1 07; 
76/103/104/210; 76/103/104/126/265 and/or 76/103/104/222. Most preferably the variant enzymes of the present in- 
vention comprise the substitution, deletion or insertion of an amino acid residue in the foiiowing combination of residues: 
76/99; 76/104; 76/99/104; 76/103/104; 76/104/107; 76/101/103/104; 76/99/101/103/104 and 76/101/104 of B. amy- 

^0 toliquefacfens subtillsin. 

[0011] The invention also includes variant DNA sequences encoding such carbonyl hydrolase or subtillsin variants. 
These variant DNA sequences are derived from a precursor DNA sequence which encodes a naturally-occurring or 
recombinant precursor enzyme. The variant DNA sequences are derived by modifying the precursor DNA sequence 
to encode the substitution of one or more specific amino acid residues encoded by the precursor DNA sequence 

45 correspondingto positions 76, 99, 101, 103, 104,107,123,27, 105, 109, 126, 128,135, 156,166, 195, 197,204,206, 
210, 21 6, 21 7, 21 B, 222, 260, 265 and/or 274 in BaciUus amyfofi'quefaaens or any combination thereof. Although the 
amino add residues identified for modification herein are identified according to the numbering applicable to B. amy- 
foHquetadens (which has become the conventional method for identifying residue positions in ali subtillslns), the pre- 
ferred precursor DNA sequence useful In the present invention is the DNA sequence of BaciUus Ientus as shown in 

50 Fig. 6 (Seq ID No 11). 

[0012] The variant DNA sequences of the present invention encode the Insertion or substitution of the amino acid 
residue 76 in combination with one or more additional modification. Preferably the variant DNA sequences encode the 

substitution or insertion of amino acid residues in the following combinations: 76/99; 76/101 ; 76/103; 75/104: 76/107; 
76/123; 76/99/101; 76/99/103; 76/99/1 04; 76/101/103; 76/101/104; 76/103/104; 75/104/107; 76/104/123; 76/1 07/123; 
55 76/99/1 01/1 03; 76/99/1 01 /I 04; 76/99/1 03/1 04; 76/1 01/1 03/1 04; 76/1 03/1 04/1 23; 76/1 04/1 07/1 23; 76/99/1 01 /I 03/1 04; 
76/99/103/104/123; 76/99/101/103/104/123; 76/103/104/128; 76/103/104/260; 76/103/104/265; 76/103/104/197; 
76/103/104/105; 76/103/104/135; 76/103/104/126; 76/103/104/107; 76/103/104/210; 76/103/104/126/265 and/or 
76/103/1 04/222. Most preferably the variant DNA sequences encode for the modification of the foiiowing combinations 
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of residues: 76/99; 76/104; 76/99/104; 76/103/104; 76/104/107; 76/101/103/104; 76/99/101/103/104 and 76/101/104. 
These recombinant DNA sequences encode carbonyl hydrolase variants having a novel amino acid sequence and, In 
general, at least one property which is substantially different from the same property of the enzyme encoded by the 
precursor carbonyl hydrolase DNA sequence. Such properties include proteolytic activity, substrate specificity, stability, 

5 altered pH profile and/or enhanced performance characteristics. 

[0013] The present invention encompasses the substitution of any of the nineteen naturally occurring L-amino acids 
at the designated amino acid residue positions. Such substitutions can be made in any precursor subtilisin (procaryotic, 
eucaryotic, mammalian, etc.). Preferably, the substitution to be made at each of tfie Identified amino acid residue 
positions Include but are not limited to: substitutions at position 76 including D, H, E, G, F, K, P and N; substitutions at 

^0 position 99 including D, T, N, G and S; substitutions at position 101 including G, D, K, L, A, E, S and R; substitutions 
at position 1 03 including Q, T E, G, R, S and A; substitutions at position 1 04 including all nineteen naturally- 
occurring amino acids; substitutions at position 1 07 including V, L, M, Y, G, E, F, T, S, A, N and I; substitutions at position 
123 including N, T, I, G, A, C and S; substitutions at position 27 including K, N, C, V and T; substitutions at position 
105 including A, D, G, R and N; substitutions at position 107 including A, L, V, Y, G, F, X S and A; substitutions at 
position 109 including S, K, R, A, N and D; substitutions at position 126 including A, F, I, V and G; substitutions at 
position 12S including G, L and A; substitutions at position 135 including A, F, I, S and V; substitutions at position 156 
including D, E, A, G, Q and K; substitutions at position 166 including all nineteen naturally-occurring amino acids; 
substitutions at position 1 95 including E; substitutions at position 1 97 including E; substitutions at position 204 Including 
A, G, C, S and D; substitutions at position 206 including L, Y. N, D and E; substitutions at position 210 including L, I, 

20 s, C and F; substitutions at position 216 including V, E, T and K; substitutions at position 217 including all nineteen 
naturally-occurring amino acids; substitutions at position 218 including S, A, G, T and V; substitutions at position 222 
including all nineteen naturally-occurring amino acids; substitutions at position 260 Including P, N, G, A, C, K and 
D; substitutions at position 265 including N, G, A, S, G, K, Y and H; and substitutions at position 274 including A and 
S. The specifically preferred amino acid(s} to be substituted at each such position are designated below in Table I. 

25 Although specific amino acids are shown in Table I, it should be understood that any amino acid may be substituted 
at the identified residues. 



Table I 



40 



Amino Add Residue 


Preferred Amino Acid to be Substituted/inserted 


+76 


D,H 


+99 


DT.N.G 


+ 101 


R,G,D,K,LAE 


+103 


A,Q,T,D,E,Y,K,G,R 


+104 


l,Y,S,L,A,T,G,F,M,W,D,V,N 


+ 107 


V,L,Y,G,F,T,SAN 


+ 123 


S.TJ 


+27 


K 


+ 105 


A,D, 


+109 


S,K,R 


+126 


A,I.V,F 


+ 128 


G,L 


+ 135 


IAS 


+156 




+ 166 


D,G,E,K,N,A,FJ,V,L 


+ 195 


E 


+197 


E 


+204 


A,G,C 


+206 


L 


+210 


l,S,G 


+216 


V 


+217 


HJ,Y,CAG:F,S,N,E,K 


+218 


S 


+222 


A,Q,S,G,I,K 


+260 


PA,S,N,G 
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Table I (continued) 



Amino Acid Residue 


Preferred Amino Acid to be Substitutedylnserted 


+265 
+274 


NAG.S 



[0014] Further, the invention includes expression vectors containing such variant carbonyl hydrolase DNA sequenc- 
es, as well as host cells transfornned with such vectors which are capable of producing such variants. The invention 
also relates to detergent compositions comprising the carbonyl hydrolase variants of the invention. 

Brief Description of the Drawings 

[0015] 

Figs. 1 A-C depict the DNA and amino acid sequence for BacUlus amylofiquefaciens suWWsln and a partial restric- 
tion map of this gene (Seq. ID No. 6}. 

Fig. 2 depicts the conserved amino acid residues among subtillsins from Baciflus amytoUquefaciens (BPN)' and 
Bacitlus I e nt us (wi Id -ty p e) . 

Figs. 3A and 3B depict the amino acid sequence of four subtilisins. The top line represents theamino acid sequence 
of subtilisin from Bachlus amyioUquefactens su\^X\\\s\n (also sometimes referred to as subtilisin BPN'} (Seq. ID No. 
7). The second line depicts the amino acid sequence of subtilisin from BacUlus subtiiis (Seq. ID No.8). The third 
line depicts the amino acid sequence of subtilisin from B. Ucheniformis (Seq. ID No. 9). The fourth line depicts the 
amino acid sequence of subtilisin from Baciiius lentus (also referred to as subtilisin 309 in PCT WO89/0B276) (Seq. 
ID No.1 0). The symbol * denotes the absence of specific amino acid residues as compared to subtilisin BPN'. 

Fig. 4 depicts the construction of plasmid GGA274. 

Fig. 5 depicts the construction of GGT274 which is an intermediate to certain expression plasmids used in this 
application. 

Figs. 6A and 68 depict the DNA and amino acid sequence of subtilisin from Baciflus fentus (Seq. ID No.11). The 
mature subtilisin protein is coded by the codons beginning at the codon GCG (334-336) corresponding to Ala. 

Figs. 7A and 7B depict the DNA and amino acid sequence of a preferred embodiment of the invention (N76D/ 
S103AA/104I) (Seq. ID No. 12). The DNA in this figure has been modified by the methods described to encode 
aspartate at position 76, alanine at position 1 03 and isoleucine at position 1 04. The mature subtilisin variant protein 
is coded by the codons beginning at the codon GCG (334-336) corresponding to Ala. 

Fig. 8 depicts the construction of vector pBCDAICAT. 

Fig. 9 depicts the construction of vector pUCCATFNA. 

Fig. 10 shows the stability of a preferred mutant enzyme compared to wild-type, in a liquid detergent fomnuiation. 
Detailed Description of the Invention 

[0016] It has been discovered that in wfro mutations in B. /e;?ft/s subtilisin at an amino acid residue equivalent to +76 
in Baciflus amyfofiquetaciens subWWs'm produces subtilisin variants exhibiting altered stability (e.g., modified autopro- 
teoiytic stability) over precursor subtilisins. (See Tables IV and Vi.) 

[0017] It has also been discovered that in vitro mutation at residues equivalent to +99, +101 , +103, +104, +107, 
+ 123, +27, +105, +109, +126, +128, +135, +156, +166, +195, +197, +204, +206, +21 0, +21 6, +21 7, +218, +222, +260, 
+265 and/or +274 in Bacillus amyloliquefaciens subtilisin, alone or in combination with each other and in any combi- 
nation with +76 mutations, produce subtilisin variants exhibiting altered proteolytic activity, altered themriai stability, 
altered pi-i profile, altered substrate specificity and/or altered performance characteristics. 
[0018] Carbonyl hydrolases are enzymes which hydrolyze compounds containing 
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bonds in which X is oxygen or n itrogen. They include naturaiiy-occurring carbonyi hydrolases and recombinant cariDonyl 
hydrolases. Naturally-occurring carbonyi hydrolases principally include hydrolases, e.g , peptide hydrolases such as 
subtlllslns or metalloproteases. Peptide hydrolases Include ct-aminoacylpeptide hydrolase, peptldylannino acid hydro- 
lase, acylamino hydrolase^ serine carboxypeptldase, metallocarboxypeptidase, thiol proteinase, carboxylproteinase 
^0 and metal loprotelnase. Serine, metallo, thiol and acid proteases are Included, as well as endo and exo-proteases. 
[0019] "Recombinant carbonyi hydrolase" refers to a carbonyi hydrolase in which the DNA sequence encoding the 
naturally-occurring carbonyi hydrolase is modified to produce a mutant DNA sequence which encodes the substitution, 
insertion or deletion of one or more amino acids in the carbonyi hydrolase amino acid sequence. Suitable modification 
methods are disclosed herein, and in US Patent 4,760,025 (RE 34,606), US Patent 5,204,01 5 and US Patent 5,1 85,258, 
the disclosure of which are incorporated herein by reference. 

[0020] Subtillslns are bacterial orfungal carbonyi hydrolases which generally act to cleave peptide bonds of proteins 
or peptides. As used herein, "subtilisin" means a naturally-occurring subtilisin or a recombinant subtilisin. A series of 
naturally-occurring subtillslns is known to be produced and often secreted by various microbial species. Amino acid 
sequences of the members of this series are not entirely homologous. However, the subtilisins in this series exhibit 

20 the same or similar type of proteolytic activity. This class of serine proteases shares a common amino acid sequence 
defining a catalytic triad which distinguishes them from the chymotrypsin related class of serine proteases. The sub- 
tilisins and chymotrypsin related serine proteases both have a catalytic triad comprising aspartate, histidlne and serine. 
In the subtilisin related proteases the relative order of these amino acids, reading from the amino to carboxy terminus, 
is aspartate-histidine-serine. In the chymotrypsin related proteases the relative order, however, is histidine-aspartate- 

25 serine. Thus, subtilisin herein refers to a serine protease having the catalytic triad of subtilisin related proteases. Ex- 
amples include but are not limited to the subtilisins identified in Fig. 3 herein. 

[0021] "Recombinant subtilisin" refers to a subtilisin in which the DNA sequence encoding the subtilisin is modified 
to produce a variant (or mutant) DNA sequence which encodes the substitution, deletion or insertion of one or more 
amino acids In the naturally-occurring subtilisin amino acid sequence. Suitable methods to produce such modification, 
30 and which may be combined with those disclosed herein, Include those disclosed In US Patent 4,760,025 (RE 34,606), 
US Patent 5,204,015 and US Patent 5,185,258. 

[0022] "Non-human carbonyi hydrolases" and the DNA encoding them may be obtained from many procaryotic and 
eucaryotic organisms. Suitable examples of procaryotic organisms Include gram negative organisms such as E. coli 
or Pseudomonas and gram positive bacteria such as Micrococcusor Baalfus. Examples of eucaryotic organisms from 

35 which carbonyi hydrolase and their genes may be obtained include yeast such as Saccharomyces cerevtsiae, fungi 
such as Aspergillus sp. and non-human mammalian sources such as, for example, bovine sp. from which the gene 
encoding the carbonyi hydrolase chymosin can be obtained. As with subtilisins, a series of carbonyi hydrolases can 
be obtained from various related species which have amino acid sequences which are not entirely homologous between 
the members of that series but which nevertheless exhibit the same or similar type of biological activity. Thus, non- 

^0 human carbonyi hydrolase as used herein has a functional definition which refers to carbonyi hydrolases which are 
associated, directly or Indirectly, with procaryotic and eucaryotic sources, 

[0023] A "carbonyi hydrolase variant" has an amino add sequence which is derived from the amino acid sequence 
of a "precursor carbonyi hydrolase." The precursor carbonyi hydrolases (such as a subtilisin) include naturally-occurring 
carbonyi hydrolases (subtilisin) and recombinant carbonyi hydrolases (subtilisin). The amino acid sequence of the 
carbonyi hydrolase variant is "derived" from the precursor hydrolase amino acid sequence by the substitution, deletion 
or insertion of one or more amino acids of the precursor amino acid sequence. Such modification is of the "precursor 
DNA sequence" which encodes the amino acid sequence of the precursor carbonyi hydrolase (subtilisin) rather than 
manipulation of the precursor carbonyi hydrolase (subtilisin) enzyme perse. Suitable methods for such manipulation 
of the precursor DNA sequence include methods disclosed herein, as well as methods l<nown to those skilled in the 

so art (see, for example, BP 0 328299, WOB9/06279 and the US patents and applications already referenced herein). 
[0024] Specific residues corresponding to position +76 in combination with one or more of the following positions 
+99, +101, +103, +104, +107, +123, +27, +105, +109, +126, +128, +135, +156, +166, +195, +197, +204, +206, +210, 
+216, +217, +218, +222, +260, +265 and/or +274 of Bacilfus amyfoffquefaciens subX\\\s\n are identified herein for mu- 
tation. Preferably the modified residues are selected from the following combinations: 76/99; 76/101; 76/103; 76/104; 

55 76/107; 76/123; 76/99/101; 76/99/103; 76/99/104; 76/101/103; 76/101/104; 76/103/104; 76/104/107; 76/104/123; 
76/107/123; 76/99/101/103; 76/99/101/104; 76/99/103/104; 76/101/103/104; 76/103/104/123; 76/104/107/123; 
76/99/101/103/104; 76/99/103/104/123; 76/99/101/103/104/123; 76/103/104/128; 76/103/104/260; 76/103/104/265; 
76/1 03/1 04/1 97; 76/1 03/1 04/1 05; 76/1 03/1 04/1 35; 76/1 03/1 04/1 26; 76/1 03/1 04/1 07; 76/1 03/1 04/21 0; 
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76/103/104/126/255 and/or 76/103/104/222; and most preferably are 76/99; 76/104; 75/99/104; 75/103/104; 
76/104/107; 76/101/103/104; 76/99/101/103/104 and 76/101/104. These amino acid position numbers refer to those 
assigned to the mature BaciUus amyfoUquefadenssubX\\\s\r\ sequence presented In Fig. 1 . The Invention, however, Is 
not limited to the mutation of this particular subtilisin but extends to precursor carbonyl hydrolases containing amino 
5 acid residues at positions which are "equivalent" to the particular identified residues in Baciffus amyfotiquefadens Qub- 
tlllsln. In a preferred embodiment of the present Invention, the precursor subtilisin Is BacUius lentus subtlllsin and the 
substitutions, deletions or insertions are made at the equivalent amino acid residue In B. lentus con^espordlng to those 
listed above. 

[0025] A residue (amino acid) of a precursor carbonyl hydrolase Is equivalent to a residue of Baditus amyfoliquefa- 
^0 dens subtilisin If it Is either homologous (i.e., corresponding In position in either primary or tertiary structure) or anal- 
ogous to a specific residue or portion of that residue in Badllus amy/oZ/qraefac/ens subtilisin (I.e., having the same or 
similar functional capacity to combine, react, or interact chemically). 

[0026] In order to establish homology to primary structure, the amino acid sequence of a precursor carbonyl hydrolase 
is directly compared to the Badflus amy/o//guefec/ens subtilisin primary sequence and particularly to a set of residues 

IS i<nownto be invariant in subtilisinsforwhich sequence is l<nown. Fig. 2 herein shows the conserved residues as between 
B. amylofiquefadens subt\\\s\r\ and B. lentus subtilisin. After aligning the conserved residues, aiiowing for necessary 
insertions and deletions in order to maintain alignment (i.e., avoiding the elimination of conserved residues through 
arbitrary deletion and Insertion), the residues equivalent to particular amino acids In the primary sequence of Badtius 
amyioliquefadens subtilisin are defined. Alignment of conserved residues preferably should conserve 100% of such 

20 residues. However, alignment of greater than 75% or as little as 50% of conserved residues is also adequate to define 
equivalent residues, Consen/atlon of the catalytic triad, Asp32/Hls64/Ser221 should be maintained. 
[0027] Fo r exampi e J n Fig . 3 1 h e am in o ac Id seq uence of s u btii Isi n from Badflus amyioliquefadens, Badllus subtifis, 
Badlfus fichenifbrmis {carfsbergensis) and Bad/Ius lentus are aligned to provide the maximum amount of homology 
between amino acid sequences. A comparison of these sequences shows that there are a number of conserved res- 

25 idues contained in each sequence. These conserved residues (as between BPN' and B. lentus) are identified in Fig. 2. 
[0028] These conserved residues, thus, may be used to define the corresponding equivalent amino acid residues 
of BadUus amylofiquefadens subX\\\s\r\ in other carbonyl hydrolases such as subtilisin from BaciUus lentus (PCT Pub- 
lication No. W089/06279 published July 13, 1989), the preferred subtilisin precursor enzyme herein, or the subtilisin 
referred to as PB92 (EP 0 328 299), which is highly homologous to the preferred Badllus lentus subtilisin. The amino 

30 acid sequences of certain of these subtlllslns are aligned In Figs. 3A and 3B with the sequence of Badllus amyloUq- 
t/efac/ens subtilisin to produce the maximum homology of conserved residues. As can be seen, there are a number of 
deletions in the sequence of Baditus lentus as compared to Badflus amylofiquefadens subtilisin. Thus, for example, 
the equivalent amino acid for Val165 In Badllus amyfoUquekLdens subW\\B\x\ in the other subtlllslns Is Isoleuclnefor B. 
fentus and B. fichen/formis. 

35 [0029] Thus, for example, the amino acid at position +76 is asparagine (N) In both B. amyfotiquefadens and B. lentus 

subtiiisins. In the preferred subtilisin variant of the invention, however, the amino acid equivalent to +76 in BadUus 
amy/oy/qt/e/ac/ens subtilisin is substituted with aspartate (D). A comparison of certain of the amino acid residues iden- 
tified herein for substitution versus the most preferred substitution for each such position Is provided In Table II for 
illustrative purposes. 

40 

Table II 



45 





+76 


+99 


+101 


+103 


+ 104 


+107 


+123 


B. amyfoliquefadens [wild -type) 


N 


D 


S 


Q 


Y 


1 


N 


B. lentus (wild-type) 


N 


S 


s 


S 


V 


1 


N 


iVIost Preferred Substitution 


D 


D 


R 


A 


l/Y 


V 


S 



[0030] Equivalent residues may also be defined by determining homology at the level of tertiary structure for a pre- 
cursor carbonyl hydrolase whose tertiary structure has been determined by x-ray crystallography, Equivalent residues 

50 are defined as those for which the atomic coordinates of two or more of the main chain atoms of a particular amino 
acid residue of the precursor carbonyl hydrolase and Badllus amylollquef adens sub]\\\&m (N on N, CA on CA, C on C 
and O on O) are within 0.1 3nm and preferably 0.1 nm after alignment. Alignment Is achieved after the best model has 
been oriented and positioned to give the maximum overlap of atomic coordinates of non-hydrogen protein atoms of 
the carbonyl hydrolase in question to the Badtius amylofiquefadens subtilisin. The best model is the crystaiiographic 

55 model giving the lowest R factor for experimental diffraction data at the highest resolution available. 
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^ factor = ^ ^ \J. ^ 
Tf,\Fo(h)\ 

5 [0031 1 Equivalent residues which are functionaiiy analogous to a specific residue of Bacillus amylotiquetaciens sub- 
tillsln are defined as those amino acids of the precursor carbonyl hydrolases which may adopt a conformation such 
that they either alter, modify or contribute to protein structure, substrate binding or cataiysis In a manner defined and 
attributed to a specific residue of the Bacillus amyioiiquefaclenssubX\\\s\r\. Further, they are those residues of the pre- 
cursor carbonyl hydrolase (for which a tertiary structure has been obtained by x-ray crystallography) whicti occupy an 

^0 analogous position to the extent that, although the main chain atoms of the given residue may not satisfy the criteria 
of equivalence on the basis of occupying a homologous position, the atomic coordinates of at least two of the side 
chain atoms of the residue lie with 0.1 3nm of the corresponding side chain atoms of &aciiiusamyioUquefacienssu\:i\\\\s\v\. 
The coordinates of the three dimensional structure of Bacillus amy/o//qfi/efec/e/?ssubtilisln are set forth in EPO Publi- 
cation No. 0 251 446 (equivalent to US Patent Application SN 08/212,291, the disclosure of which is incorporated 
herein by reference) and can be used as outlined above to determine equivalent residues on the ievei of tertiary struc- 
ture. 

[0032] Some of the residues identified for substitution, insertion or deletion are conserved residues whereas others 
are not. In the case of residues which are not conserved^ the replacement of one or more amino acids Is limited to 
substitutions which produce a variant which has an amino acid sequence that does not correspond to one found in 
20 nature. In the case of conserved residues, such replacements should not result in a naturally- occurring sequence. The 
carbonyl hydrolase variants of the present Invention Include the mature forms of carbonyl hydrolase variants, as weii 
as the pro- and prepro-forms of such hydrolase variants. The prepro-forms are the preferred construction since this 
facilitates the expression, secretion and maturation of the carbonyl hydrolase variants. 

[0033] "Prosequence" refers to a sequence of amino acids bound to the N-tennlnal portion of the mature form of a 
25 carbonyl hydrolase which when removed results In the appearance of the "mature" form of the carbonyi hydrolase. 
Many proteolytic enzymes are found in nature as translational proenzyme products and, In the absence of post-trans- 
iationai processing, are expressed in this fashion. A preferred prosequence for producing carbonyl hydro ase variants, 
specifically subtlllsin variants, Is the putative prosequence of Bacillus amylotiquetaciens subX\\\s\r\, aithough other sub- 
tlllsln prosequences may be used. In Examples 1 and 2 the putative prosequence from the subtlllsin from Bacillus 
30 tentus (ATCC 21 536) was used. 

[0034] A "signal sequence" or "presequence" refers to any sequence of amino acids bound to the N-terminal portion 
of a carbonyl hydrolase or to the N-terminal portion of a pro hydrolase which may participate in the secretion of the 
mature or pro forms of the hydrolase. This definition of signal sequence Is afunctional one, meant to Include all those 
amino acid sequences encoded by the N-terminal portion of the subtilisin gene or other secretable carbonyl hydrolases 
35 which participate in the effectuation of the secretion of subtlllsin or other carbonyl hydrolases under native conditions. 
The present invention utilizes such sequences to effect the secretion of the carbonyl hydrolase variants as defined 
herein, A preferred signal sequence used In the Examples comprises the first seven amino acid residues of the signal 
sequence from Bacillus subttlis subx\\\s\n fused to the remainder of the signal sequence of the subtlllsin from Bacillus 
ientus (ATCC 21536). 

40 [0035] A "prepro" form of a carbonyl hydrolase variant consists of the mature form of the hydrolase having a prose- 
quence operably llni<ed to the amino terminus of the hydrolase and a "pre" or "signal" sequence operably linked to the 
amino tennlnus of the prosequence. 

[0036] "Expression vector" refers to a DNA construct containing a DNA sequence which is operably linl<ed to a 
suitable control sequence capable of effecting the expression of said DNA in a suitable host. Such control sequences 
45 Include a promoter to effect transcription, an optional operator sequence to control such transcription^ a sequence 
encoding suitable mRNA ribosome binding sites and sequences which control termination of transcription and trans- 
lation. The vector may be a plasmid, a phage particle, or simply a potential genomic insert. Once transformed into a 
suitable host, the vector may replicate and function Independently of the host genome, or may, In some Instances, 
integrate into the genome itself, in the present specification, "plasmid" and "vector" are sometimes used Interchange- 
so ably as the plasmid is the most commonly used form of vector at present. However, the Invention is intended to include 
such other forms of expression vectors which serve equivalent functions and which are, or become, known In the art. 
[0037] The "host ceils" used In the present Invention generally are procaryotic or eucaryotic hosts which preferably 
have been manipulated by the methods disclosed in US Patent 4,760,025 (RE 34,606) to render them incapable of 
secreting enzymatically active endoprotease. A preferred host cell for expressing subtilisin is the Bacillus strain BG2036 
55 which is deficient in enzymatically active neutral protease and all<aline protease (subtilisin). The construction of strain 
BG2036 is described in detail in US Patent 5,264,366. Other host cells for expressing subtilisin include Bacillus subtilis 
1168 (also described In US Patent 4,760,025 (RE 34,606) and US Patent 5,264,366, the disclosure of which are Incor- 
porated herein by reference), as well as any suitable Bacillus strain such as B. Ilctieniformis, B. ientus, etc. 
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[0038] Host cells are transformed or transfected with vectors constructed using recombinant DNA techniques. Such 
transformed host cells are capable of either replicating vectors encoding the carbonyl hydrolase variants or expressing 
the desired carbonyl hydrolase variant. In the case of vectors which encode the pre- or prepro-fonn of the carbonyl 
hydrolase variant, such variants, when expressed, are typically secreted from the host ceil into the host ceii medium, 

5 [0039] "Operably linked, " when describing the relationship between two DNA regions, simply means that they are 
functionally related to each other. For example, apresequence Is operably linked to a peptide If it functions as a signal 
sequence, participating in the secretion of the mature form of the protein most probably involving cleavage of the signal 
sequence. A promoter Is operably linked to a coding sequence if It controls the transcription of the sequence; a ribosome 
binding site Is operably linked to a coding sequence if it is positioned so as to permit translation. 

10 [0040] The genes encoding the naturally-occurring precursor carbonyl hydrolase may be obtained in accord with the 
general methods known to those skilled in the art. The methods generally comprise synthesizing labeled probes having 
putative sequences encoding regions of the hydrolase of interest, preparing genomic libraries from organisms express- 
ing the hydrolase, and screening the librariesforthegene of interest by hybridization to the probes. Positively hybridizing 
clones are then mapped and sequenced, The B. fentusgene used in the Examples was cloned as described in Example 

f5 1 of US Patent 5, 185,258, the disclosure of which is incorporated herein. The BPN' gene used in Example 5 was cloned 
as described in Example 1 in RE 34,606, the disclosure of which is incorporated herein. 

[0041] The cloned carbonyl hydrolase is then used to transform a host cell in order to express the hydrolase. The 
hydrolase gene Is then llgated Into a high copy number plasmld. This plasmid replicates In hosts In the sense that It 

contains the well-known elements necessary for plasmid replication: a promoter operably linked to the gene in question 
20 (which may be supplied as the gene's own homologous promoter if it is recognized, i.e., transcribed, by the host), a 
transcription termination and poiyadenylation region (necessary for stability of the mRN A transcribed by the host from 
the hydrolase gene In certain eucaryotic host cells) which is exogenous or is supplied by the endogenous terminator 
region of the hydrolase gene and, desirably, a selection gene such as an antibiotic resistance gene that enables con- 
tinuous cultural maintenance of plasm id -infected host cells by growth in antibiotic-containing media. High copy number 
25 plasmids also contain an origin of replication forthe host, thereby enabling large numbers of plasmids to be generated 
in the cytoplasm without chromosomal limitations. However, it is within the scope herein to integrate multiple copies 
of the hydrolase gene into host genome. This is facilitated by procaryotic and eucaryotic organisms which are partic- 
ularly susceptible to homologous recombination. 

[0042] The genes used in the present examples are a natural B. lentus gene and a natural B. amyloUquefaciens 
30 gene. Alternatively, a synthetic gene encoding a naturally- occurring or mutant precursor carbonyl hydrolase (subtlllsin) 
may be produced, in such an approach, the DNA and/or amino acid sequence of the precursor hydrolase (subtiiisin) 
is determined. Multiple, overlapping synthetic single-stranded DNA fragments are thereafter synthesized, which upon 
hybridization and ligation produce a synthetic DNA encoding the precursor hydrolase, An example of synthetic gene 
construction is set forth in Example 3 of US Patent 5,204,015, the disclosure of which is incorporated herein by refer- 
35 ence. 

[0043] Once the naturally-occurring or synthetic precursor carbonyl hydrolase gene has been cloned, a number of 
modifications are undertaken to enhance the use of the gene beyond synthesis of the naturally-occurring precursor 
carbonyl hydrolase. Such modifications include the production of recombinant carbonyl hydrolases as disclosed in US 
Patent 4,760,025 (RE 34,606) and EPO Publication No. 0 251 446 and the production of carbonyl hydrolase variants 
40 described herein. 

[0044] The following cassette mutagenesis method may be used to facilitate the constmctlon and identification of 
the carbonyl hydrolase variants of the present Invention, although other methods Including site-directed mutagenesis 
may be used. First, the naturally-occurring gene encoding the hydrolase is obtained and sequenced in whole or in part. 
Then the sequence is scanned for a point at which it is desired to make a mutation (deletion, insertion or substitution) 

45 of one or more amino acids in the encoded enzyme. The sequences flanking this point are evaluated for the presence 
of restriction sites for replacing a short segment of the gene with an oligonucleotide pool which when expressed will 
encode various mutants. Such restriction sites are preferably unique sites within the hydrolase gene so as to facilitate 
the replacement of the gene segment. However, any convenient restriction site which is not overly redundant in the 
hydrolase gene may be used, provided the gene fragments generated by restriction digestion can be reassembled in 

50 proper sequence. If restriction sites are not present at locations within a convenient distance from the selected point 
(from 10 to 15 nucleotides), such sites are generated by substituting nucleotides In the gene In such a fashion that 
neither the reading frame nor the amino acids encoded are changed In the final construction. Mutation of the gene In 
order to change its sequence to conform to the desired sequence is accomplished by M13 primer extension in accord 
with generally known methods. The task of locating suitable flanking regions and evaluating the needed changes to 

55 arrive at two convenient restriction site sequences is made routine by the redundancy of the genetic code, a restriction 
enzyme map of the gene and the large number of different restriction enzymes. Note that if a convenient flanking 
restriction site is available, the above method need be used only in connection with the flanking region which does not 
contain a site. 
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[0045] Once the naturally-occurring DN A or synthetic DNA Is cloned, the restriction sites flanking the positions to be 
mutated are digested with the cognate restriction enzymes and a plurality of end termini -complementary oligonucleotide 
cassettes are llgated Into the gene. The mutagenesis Is simplified by this method because all of the oligonucleotides 
can be synthesized so as to have the same restriction sites, and no synthetic linl<ers are necessary to create the 
5 restriction sites. 

[0046] As used herein, proteolytic activity is defined as the rate of hydrolysis of peptide bonds per milligram of active 
enzyme. Many well known procedures exist for measuring proteolytic activity (K. M. Kalisz, "Microbial Proteinases," 
Advances In Biochemical Engineering/Biotechnology , A. Fiechtered., 1988). In addition to or as an alternative to mod- 
ified proteoiytic activity, the variant enzymes of the present Invention may have other modified properties such as K^, 
^0 kca^, kcai/K^i ratio and/or mod If led substrate specificity and/or modified pH activity profile. These enzymes can be tailored 
for the particular substrate which is anticipated to be present, for example, in the preparation of peptides or for hydrolytic 
processes such as laundry uses. 

[0047] in one aspect of the invention the objective is to secure a variant carbonyi hydrolase having altered proteolytic 

activity as compared to the precursor carbonyi hydrolase, since Increasing such activity (numerically larger) enables 
the use of the enzyme to more efflclentiy act on a target substrate. Also of Interest are variant enzymes having altered 
thermal stability and/or altered substrate specificity as compared to the precursor. Preferably the carbonyi hydrolase 
to be mutated is a subtiiisin. Specific amino acids useful to obtain such results In subtllisin-type carbonyi hydrolases 
at residues equivalent to +76, +99, +101 , +103, +104, +107, +123 +27, +105, +109^ +126, +128, +135, +156^ +166, 
+1 95. +1 97, +204. +206. +21 0, +21 6, +21 7, +218, +222, +260, +265 and/or +274 or any combination thereof in Bacillus 

20 amyioliquefaciens subtiiisin are presented In the Examples. In some instances, lower proteolytic activity may be de- 
sirable, for example a decrease In proteolytic activity would be useful where the synthetic activity of the carbonyi hy- 
drolases is desired (as for synthesizing peptides). One may wish to decrease this proteolytic activity, which is capable 
of destroying the product of such synthesis. Conversely, in some instances it may be desirable to increase the proteolytic 
activity of the variant enzyme versus its precursor. Additionally, increases or decreases (alteration) of the stability of 

25 the variant, whether alkaline or thermal stability may be desirable, increases or decreases in k^af or ^cs\/K^ are 
specific to the substrate used to determine these kinetic parameters. 

[0048] in another aspect of the invention, it has been determined that residues equivalent to +76 In combination with 
a number of other modifications In subtiiisin are Important In modulating overall stability and/or proteolytic activity of 
the enzyme. Thus, as set forth In the Examples, the Asparaglne (N) in Baciflus lentus subtiiisin at equivalent position 
30 +76 can be substituted with Aspartate (D) In the prefen-ed embodiment In combination with modification of one or more 

of the following amino acid residues +99, +101 , +103, +104, +107, +123, +27, +105, +109, +126, +128, +135, +156, 
+1 66, +1 95, +1 97. +204, +206, +21 0, +21 6, +21 7. +21 8, +222, +260, +265 and/or +274, to produce enhanced stability 
and/or enhanced activity of the resulting mutant enzyme. 

[0049] The most preferred embodiments of the invention are set forth in the Examples. These include the following 
35 specific combinations of substituted residues: N76D/S99D; N76DA/104i; N76D/S99DA/104i; N76D/S103AA/1 041; 
N76D/V1 041/11 07V; N76D/V1 04Y/I1 07V and N76D/S1 01 R/S1 03AiV1 041. Also described In the Examples are all mu- 
tant combinations claimed in the present invention. These substitutions are preferably mads in Bacilfus lontus (recom- 
binant or native-type) subtiiisin, although the substitutions may be made in any Sac///t/s subtiiisin. 
[0050] Based on the results obtained with this and other variant subtilislns, it is apparent that residues in carbonyi 
^0 hydrolases (preferably subtiiisin) equivalent to positions +76^ +99, +101 , +103, +104, +107, +123, +27, +105, +109, 
+126, +128, +135, +156, +166, +195, +197, +204, +206, +210, +216, +217, +218, +222, +260, +265 and/or +274 In 
Bacflfus amytofiquefaciens are important to the proteolytic activity, performance and/or stability of these enzymes and 
the cleaning or wash performance of such variant enzymes. 

[0051 ] Many of the carbonyi hydrolase variants of the invention, especially subtiiisin, are useful in formulating various 
detergent compositions. A number of known compounds are suitable surfactants useful in compositions comprising 
the carbonyi hydrolase mutants of the invention. These include nonlonic, anionic, cationic, anionic or zwitterionic de- 
tergents, as disclosed in US 4,404,1 28 to Barry J. Anderson and US 4,261 ,868 to Jiri Flora, et al. A suitable detergent 
formulation Is that described In Example 7 of US Patent 5,204,015 (previously Incorporated by reference), The art Is 
familiar with the different formulations which can be used as cleaning compositions. In addition to typical cleaning 

50 compositions. It is readily understood that the subtiiisin variants of the present Invention may be used for any purpose 
that native or wild-type subtilislns are used. Thus, these variants can be used, for example, in bar or liquid soap appli- 
cations, dishcare formulations, contact lens cleaning solutions or products, peptide hydrolysis, waste treatment, textile 
applications, as fusion-cleavage enzymes in protein production, etc, The variants of the present Invention may comprise 
enhanced performance in a detergent composition (as compared to the precursor). As used herein, enhanced per- 

55 formance in a detergent is defined as increasing cleaning of certain enzyme sensitive stains such as grass or blood, 
as detennined by usual evaluation after a standard wash cycle, 

[0052] Subtilislns of the invention can be formulated into known powdered and liquid detergents having plH between 
6.5 and 12.0 at levels of about .01 to about 5% (preferably .1% to .5%) by weight. These detergent cleaning compo- 
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sitlons can also Include other enzymes such as known proteases, amylases, oellulases. lipases or endoglycosldases, 
as well as builders and stabilizers. 

[0053] The addition of subtilisins of the invention to conventional cleaning compositions does not create any special 
use limitation. 

5 I n other words, any temperature and pH suitable for the detergent is also suitable for the present compositions as long 
as the pH is within the above range, and the temperature Is below the described subtlllsln's denaturing temperature. 
In addition, subtilisins of the invention can be used in a cleaning composition without detergents, again either alone or 
In combination with builders and stabilizers. 

[0054] The following Is presented by way of example and Is not to be construed as a limitation to the scope of the 

10 claims. 

Example 1 

Construction for the Expression of GG36 Gene in B. subtitis 

IS 

[0055] The cloning and the construction for expression of the subtilisin gene from B. lentus was performed essentially 
the same as that described in US Patent 5,185,258. The piasmid GGA274 (described in Fig. 4 herein) was further 
modified in the following manner, as shown In Fig. 5. The Psti site that was introduced during the construction of the 
GGA274 piasmid was removed by the oilgonucieotide directed mutagenesis described below, with an oligonucleotide 

20 havingthefoilowirg sequence: 5' GAAGCTGCAACTCGTTAAA3' (Seq, ID No.1). The underlined "A" residue eliminated 
the recognition sequence of restriction enzyme PstI and changed the corresponding amino acid residue from alanine 
to threonine at position 274. Threonine at position 274 is the wild-type residue originally found in the cloned B. lentus 
subtilisin gene sequences. The DNA segment encoding subtilisin was excised from the piasmid GGA274 or its deriv- 
atives (GGT274 shown in Fig. 5) by EcoRI and BamHi digest. The DNA fragment was subcloned back into Bacterl- 

25 ophage iVllS-based vectors, such as IV1P19, for mutagenesis. After mutagenesis, the EcoRI and Hindi II digest, followed 
by cloning, were performed to move the mutated subtilisin gene back into an expression piasmid like GGA2 74 for the 
expression and the recovery of mutated subtilisin proteins. 

Example 2 

30 

QllaQnucleotlde-Dlrected Mutagenesis 

[0056] Oligonucleotlde-dlrected mutagenesis was perfonned as described in Zolier, iVl,, et al, (1983), Methods En- 
zymol. , 100:468-500. As an example, a synthetic oligonucleotide of the sequence 5' GGTGCTCTAGACAATTGG 3' 
35 (Seq. ID No.2) was used to change the amino acid residue at position 76 from asparaglne (N) to aspartic acid (D), or 
N76D. The underiined "G" and "C" residues denote changes from the wild-type gene sequence. The CA keeps the 
leucine at position +75 and changes the amino acid sequence to introduce an Xbai recognition site of the Xbai restriction 
enzyme (TCTAGA), while the change at GAC changes asparagine at +76 to aspartate. 

[0057] For mutagenesis at positions 99, 1 01 , 1 03 and 1 04^ different oligonucleotides can be used depending on the 
^0 combination of mutations desired. For example, an oligonucleotide of the sequence 5' GTATTAGGGGCGGACGGTC- 
GAGGCGGCATCAGGTCGATT 3'{Seq. ID No.3) was used to simultaneously make the following changes: S99D; 
SI 01 R; SI 03A and VI 041 in a single subtilisin molecule. Similarly, oligonucleotides of the sequence 5' TCAGGTTCG- 
GTCTCGAGCGTTGCCCAAGGATTG 3' (Seq. ID No.4) and 5' CACGTTGCTAGCTTGAGTTTAG 3' (Seq. ID No.5) 
were utilized to generate 1107V and N123S, respectively. Again, the underiined residues denote changes from wild- 
type sequences which produced desired changes either in amino acid sequences or restriction enzyme recognition 
sequences. 

Example 3 

50 Proteolytic Activity of Subtilisin Variants 

[0058] Following the methods of Example 2, the variants listed In Table III were made. Proteolytic activity of each of 

these subtilisin variants Is shown In Table ill. The kinetic parameters k^^f K|y, and k^,a/K|y were measured for hydrolysis 
of the synthetic peptide substrate succinyl-L-Ala-L-Aia-L-Pro-L-Phe-p-nitroaniilde using the method described In P. 
55 Bonneau, et al. (1991) J. Am. Chem. Soc , Vol. 113, No. 3, p. 1030. Briefly, a small aliquot of subtilisin variant stock 
solution was added to a 1 cm cuvette containing substrate dissolved in O.IIVI Tris-IHGL buffer, pH 8.6, and thermostated 
at 25**C, The reaction progress was followed spectrophotometrlcally by monitoring the absorbance of the reaction 
product p-nitroanlllne at 410 nm. Kinetic parameters were obtained by using a non-linear regression algorithm to fit 
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the reaction velocity and product concentration for. each reaction to the MIchaells-Menten equation. 



Table III 



Kinetc Parameters k^.^^, K,^ and k^at^Kj^ Measured for Baaiius lentus Subti isin and Variants 








'\»I''^M IS 'VI } 


B. /enfusSubtilisin 


170 


0.00078 


2.18x105 


N76D 


219 


O.OOOS 


2.74x105 


N76D/S99D 


88 


0.00061 


1.44X105 


N76D/S103A 


400 


0.0014 


2.86x105 


N7SDA/104I 


459 


0.0011 


4.17x105 


N76D/I107V 


219 


0.0011 


1.99x105 


N76D/N123S 


115 


0.0018 


6.40x104 


N76D/S99D/S101R 


146 


0.00038 


3.84x105 


N76D/S99D/S103A 


157 


0.0012 


1.31x105 


N75D/S99D/V1 041 


247 


0.00097 


2.55x105 


N76D/S101R/S103A 


405 


0.00069 


5.90x105 


N7SD/S101R/V104I 


540 


0.00049 


1.10x106 


N76D/S103AA/104I 


832 


0.0016 


5.20x105 


N76DA/1 041/11 07V 


497 


0.00045 


1.10x106 


N76DA/104Y/I107V 


330 


0.00017 


1.90x10^ 


l\/OLJ/V WJHuW 


1 




v7.DOX 1 u 


N76D/I107V/N123S 


147 


0.0035 


4.20x104 


N76D/S99D/S1 01 R/S1 03A 


242 


0.00074 


3.27x105 


N76D/S99D/S1 01 FW1 041 


403 


0.00072 


5.60x105 


N76D/S99D/S1 03A/V1 041 


420 


0.0016 


2.62x105 


N76D/S101R/S103A/V104I 


731 


0.00065 


1.12x10® 


N76D/S1 03AA/1 041/N 1 23S 


321 


0.0026 


1.23x105 


N76DA/1 041/1 1 07V/N 1 23S 


231 


0.003 


7.70x10' 


N7BD/S99D/S1 01 R/S1 03A/V1 041 


624 


0.00098 


6.37x105 


N76D/S99D/S1 03A/V1 041/N 1 23S 


194 


0.0043 


4.51x104 


N76D/S99DyS1 01 R/S1 03AA/1 041/N 1 23S 


311 


0.0023 


1.35x105 



[0059] The results listed In Table III Indicate that all of thesubtlllsin variants tested retain proteolytic activity. Further, 
detailed analysis of the data reveal that proteolytic activity was significantly altered for Bacillus lentus subtilisin by the 
various combinations of substitutions at annino acid residues equivalent to positions 76, 99^ 101^ 103, 104, 107 and 
123 in Bacitlus stmylotiquefadens. 

Example 4 

Thermal Stability of Subtilisin Variants 

[0060] A comparison of thermal stability observed for 6acy//L/5/enrL/s&ubtilisin and the variants of the present Invention 
made by the process of Example 2 is shown in Table IV. Purified enzyme, 15 ug/ml in 0.1 M glycine 0.01% Tween-SO 
pH 10.0, witti or without 50 miVI CaClg, was aliquotted into small tubes and incubated at lO^'C for 5 minutes, 10°C to 
BO^'C over 1 minute, and 60°C for 20 minutes. Tubes were then placed on ice for 1 0 minutes. Aliquots from the tubes 
were assayed for enzyme activity by addition to 1 cm cuvettes containing 1.2 mM of the synthetic peptide substrate 
succinyl-L-ala-L-Aia-L-Pro-L-Phe-p-nitroanilide dissolved in 0.1 U tris-HCL buffer^ pH 8.6, thermostatted at 25°C. The 
initial linear reaction velocity was followed spectrophotometricalty by monitoring the absorbance of the reaction product 
p-nltroanlilne at 41 0 nm as a function of time. Data are presented as percent activity prior to heating. The results listed 
in Table IV indicate that a vast majority of variants exhibit thermal stability comparable to Baciltus /e/?ft/s subtilisin (24 
out of 26) in the test condition with 50mM CaClg added, In the test condition without 50mM CaClj added, a vast majority 
of variants (19 out of 26) are significantly more stable than Bacillus lentus subtilisin, Further, the variants N76D/S99D, 
N76D/V104I, N76D/S99DA/104I, N76D/S1 03AA/1 041, N 76 DA/ 1 041/11 07V, N76D/V104Y/I107V and N76D/S101R/ 
SI 03A/V1 041 are significantly more stable than the single substitution variant N76D in the test condition without 50mM 
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GaGIg added. 



Table IV 



ThBrmal Stability Measured for Bacillus fentus SubWWsln and Variants At pH 10, 60-'C, +/-50mM CaCl2 Added 


Enzyme 


% Initial Activity Remaining 




- CaCl2 


-1- CaClg 


B. /emi/sSubtlllsIn 


2 


96 


N76D 


34 


97 


N76D/S99D 


49 


98 


N76D/S103A 


26 


92 


N76 DA/1 041 


58 


98 


N76D/I107V 


32 


96 


N76D/N123S 


0 


97 


N76D/S99D/S101R 


30 


100 


N76D/S99D/S103A 


36 


100 


N76D/S99DA/104I 


48 


97 


N76D/S101R/S103A 


26 


100 


N76D/S101RA/104I 


38 


100 


N76D/S103AA/104I 


68 


100 


N76D/V1 041/11 07V 


60 


97 


N76D/V104Yyi107V 


48 


74 


M7ftnA/i nyi i/ivn oqc 


0 


98 


N76D/I107V/N123S 


16 


100 


N76 D/S99D/S 1 0 1 R/S 1 03 A 


38 


100 


N76 D/S99D/S 1 0 1 R/V1 04 1 


33 


100 


N76D/S99D/S 1 03A/V1 04! 


38 


98 


N76D/S1 01 R/S 1 03AA/1 041 


40 


99 


N76D/S1 03A/V1 041/N 1 23S 


1 


98 


N76DA/1 041/11 07V/N1 23S 


3 


99 


N76D/S99D/S1 01 R/S1 03A/V1 041 


36 


99 


N76 D/S99D/S 1 03AA/1 041/N 1 23S 


2 


95 


N76D/S99D/S1 01 R/S1 03/ii/V1 04I/N123S 


0 


100 



Examples 5 



Qllaonucl5Qtide-Dlrected Mutagenesis with Single-Stranded DNA Template Generated from Phagemid 
A. Construction of B. /en fug Variants 

[0061] Tine mutagenesis protocol was essentially the same as described above in Example 2. The single-stranded 
DNA template was generated by phagemid method. To construct the p in agemid vector for generating the single-strand- 
ed DNAtemplate we first constructed the vector pBCDAICAT The flow chart of vector construction is outlined in Figure 
8. First, the C/a/ to C/a/ fragment encoding the CAT gene from pC194 plasmid (Horlnouchi, S. and Weisblum, B., J. 
BactehoL , 150:8-15, 1 982) was cloned into the AccIsWe of polylinker region of pUCI 9 (New England BioLabs, Beverly, 
MA) to make plasmid pUGCHL and the EcoRI-Drat 0.6 KB fragment from the 5' end of the GG36DAI encoding DNA 
was cloned into the EcoRI and Ecofll/sites of pBSKS plasmid (Stratagene, inc^ San Diego, CA) to make pBC2SK5. 
The single EcoRI sWe of the plasmid pBC2SK5 was eliminated by Eco/?/ digestion, followed by filling in catalyzed-by- 
T4 DNA polymerase, and religation to generate the plasmid pBG2SK-5R which does not have the EcoRI site. The 
EcoRI-Dral fragment which was cloned Into pBGSK-5R was isolated as a P5f/-HM/// fragment and cloned into the 
Pstl-Hindin site of the pUCCHL (part of the polylinker of pUC1 9) to generate plasmid pUCCHLSR. The encoding se- 
quence of GG36DAI gene was excised as an EcoRi-BamHf fragment and cloned into the EcoRt-BamHi sites of 
pUCCHLSR to make pUCCAT The large EcoRI-Hindififragment of pUCCAT was then cloned into the EcoRI and Hindill 
sites of BS2KS-I- to generate the plasmid pBCDAICAT. 
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[0062] To generate single-stranded DNA. E. coll-containing pBCDAICAT were Infected with pliage R408 (obtained 
from Stratagene, San Diego, GA) foiiowing tiie protocol described in Russei, M., Kidd, S. and Keiiey, iVi.R., GENE 45: 
333-338, 1986. Once the single-stranded DNA template was available, standard mutagenesis methods as described 
above In Example 2 were carried out. The construction of certain mutants is detailed below for illustrative purposes. 
5 [00631 For the construction of a lentus (GG36) N76D/S1 03AA/1 04I/L21 7H, an EcoHl-BamHi DNA fragment encod- 
ing GG36 N76D/S103A/V104i was used In the construction of pUCCAT (see Fig. 8) to generate the piasmid pBCDA- 
ICAT. After the single-stranded DNA template was made following the protocol described above, a mutagenesis primer 
with the foiiowing sequence 

10 

5' TAT GCC AGC CAC AAC GGT ACT. TCG ATG GCT 3" (Seq. ID No. 13) 

was used to make the L217H. As before, the underlined residues denote the nucleotide changes that were made and 
IS thex C/a/ indicates that the existing C/a/site was eliminated after the mutagenesis. The mutagenesis protocol was as 
described In Example 2. Afterthe mutagenesis, piasmid DNA was first screened for the elimination of the C/a/site and 
those clones missing the Clal site were subjected to DNA sequence analysis to verify the DNA sequence which made 
the L to H change at the 21 7th amino acid residue. 

20 B. Construction of BPN' Variants and their Expression in B. subtiHs 

[0064] The construction of S. amyfoUquefaciens (BPN') N76D/Q1 03AA'104I/Y217L was done In a similar fashion 
except in two consecutive steps. N76D was first introduced into BPN' Y217Lto mai<e BPN' N76DA^21 7L and a second 
mutagenesis was done to convert BPN' N76D/Y217L to BPN' N76D/Q103A/Y104iA^217L. To generate the singie- 
25 stranded DNA template for the first mutagenesis, an Ecofl/-Ba/nH/ fragment encoding BPN' Y217Lsubtlilsin (derived 
from the Y217L piasmid described In Wells, J., et al., PNAS, 84, 5167, 1087) was used to construct a piasmid pUC- 
CATFNA (see Fig. 9). The pUCCATFNA piasmid containing BPN' Y21 7L was used to construct the pBCFNACAT pias- 
mid (Fig. 9). Single-stranded DNA was generated as described above. To generate BPN' N76D/Y217L, an oligonucle- 
otide primer with the sequence 

30 

* *** ** X XhaZ 
5 ' C ACA GTT GCG GCT CTA gAT AAC TCA ATC GGT G 3 ' (Seq. ID 
NO. 14) 

35 

was used to generate the change N76D. Single-stranded DNA was then prepared from the pBCFNACAT piasmid 
containing the BPN' N76D/Y217L (the pBCFNACAT piasmid after N76D mutagenesis) and mutagenized with another 
oligonucleotide with the sequence 

40 

* *** ** X PvuIX 
5' GCT GAG GGT TCC GGC GCT ATT AGT TGG ATC ATT 3* (Seq. ID 
No, 15) 

to obtain BPN' N76D/Q1 OSAA'1 04I/Y21 7L. All steps Involved in the cloning, the single-stranded DNA preparation, the 
mutagenesis, and the screening for mutants were carried out as described above. 

[0065] Expression of the BPN' gene and its variants were achieved by Integrating piasmid DNA (pBCFNACAT con- 
taining the different variants' BPN' gene) directly into a protease-deficient strain of Bacillus subtilis as described in RE 
34,606. 

50 [0066] Numerous variants were made as per the teachings of Examples 2 and 5. Kinetics data and stability data 
were generated for such variants. The kinetics data were generated using the methods described In Example 3 and 
are provided In Table V. The stability data were generated as detailed herein. Results are shown in Table Vi. 

Thermal Stability Assay Procedure 

55 

[0067] Purified enzyme was buffer-exchanged into 0.1 iVi glycine pH 1 0.0, 0.01 % Tween-80 by applying the enzyme 
to a column consisting of Sephadex G-25 equilibrated with this buffer and eiuting the enzyme from the column using 
the same buffer. 
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[0068] To a tube containing 0.1 iVI giycine, 0.01% Tween-80 piH 10.0 tiiemnostatted at 60°C. the buffer-excliangecl 
enzynne was added to give a final enzyme concentration of 1 5 ug/nni. 

[0069] Aliq uots were removed from tlie 60^C incubation at various times and immediately assayed for enzyme activity 
by addition to a 1 cm cuvette containing 1 .2 mlVI of tine synthetic peptide substrate succinyl-L-Ala-L-Ala-L-Pro-L-Phe- 
5 p-nitroanllide dissolved In 0, 1 M tris-HCL buffer, pH 8.6, thermostatted at 25"'C. The Initial linear reaction velocity was 
followed spectrophotometricaliy by monitoring the absorbance of the reaction product p-nitroaniiine at 410 nm as a 
function of time. 

[0070] Half -life, which Is the length of time required for 50% enzyme Inactlvatlon, was determined from the first-order 
plot of reaction velocity as a function of the time of Incubation at 50°C. 
^0 [0071] The data are presented In Table VI as percent of the half-life determined for Baci/lus /e/ift/ssubtllisin (GG36) 
under identical conditions. 



Table V 





Enzyme 


kcat 


KM 


kcat/KM 


IS 




(s-1) 


(mM) 


(s-iM-i) 




D. lenius suoiiiisin 


1 70 


n 7fi 


o one 1 ni^ 




IM f OLJ/o1 UOVJI/ V \ U^l 


OoU 


1 j4 
\ .H- 






MTRriVQH HQ A Ay H flAC 
IM f Q U/o 1 UOM/V 1 U*l-r 




n 

u.oo 


o onc_i_nft 




IM oU/Ol Uo/VVI UH"IN 


7Qn 




o finc_Lnix 




IM / DU/o 1 UOM/VI Uh-S 


1 7n 


u.oo 


■5 nnp_i_nf^ 

^.UU t-l-UO 




M"7RnyQH nQAA/i r\A~r 

IN f DLJ/ol UoM/V 1 Uh- I 


o/u 


1 Q 

1 .y 


o nnc 1 




IN / QU/o 1 U^3M/ V 1 Uh-vW 


RAO 


U.o 1 


o ftnPj_nft 


25 


In / 0 U/ o \ Uo/VV 1 UH" t 


oyu 


n 


1 iinc_i_ntf5 
1 .*HJt-i-Uo 


r<£.t rVINf DU/ V 1 Um-T/INI ^OO 


ouu 


1 






IN f QU/O 1 U1 u/ol UofvVI U^l 


Ron 


1 Q 


H-.tSUt+UO 




In /OU/O 1 Uo/VV 1 Uh-I/o 1 UDrt 




1 Q 


M-.^UC-|-U3 




M7RrbyQi ri'^ AA/i riAiyQi nf^n* 
IN fouyo 1 ujrt/v 1 u'nyoi udu 




■1 7 




30 


M7Rnyd AA/1 n>i"r/M n7A* 

IN f OlJ/ol Uo/VVl Uh- I / n \}f f\ 


H on 

1 


C 7 
O./ 


o 1 nc_i_n/l 




M7f5rsyQi HQ A A/1 n/1X/M n7l * 

IN / b u/ o 1 u o/v v 1 Uh- I / n u / l 


Tin 

o1 U 


Q O 


Q 7nc 1 nyi 




M7RriyQi riQ AA/i riAiyi iora 


Qn 

yu 




/I 1 nc_i_n.d. 




M7RnyQ1 riQAA/i fiAiyi lOftf 


i nn 


1 Q 

1 .y 


Q KnPj_n.^ 


35 


W7RnyQi n*^ A A/1 n/Liyi i^ri 


inn 

1 uu 


O A 




N76D/S1 03A/V1 04I/L126V 


64 


3.2 


2.00E+04 




N76D/S1 03AA/1 041/31 28G* 


560 


1.7 


3.30E+05 




N76D/S1 03AA/1 04I/S1 28L* 


430 


3.8 


1.10E+05 




N76D/S1 03AA/1 04I/L135A 


140 


0.76 


1.80E+05 


40 


N76D/S1 03AA/1 04iyL135F 


390 


0.69 


5.70E+05 




N76D/S1 03A/V1 04i/L135l 


110 


0.73 


1 .50E+05 




N76D/S1 03AA/1 04i/L135V 


140 


0.86 


1.60E+05 




N76D/S1 03AyV1 04I/S1 56E* 


170 


2.6 


6.50E+04 




N76D/S1 D3A/V1 041/31 66D* 


160 


3.5 


4.60E+04 


45 


N76D/S1 03A/V1 04i/D1 97E 


510 


1.4 


3.60E+05 




N76D/S1 OSAA'I 04I/N204A* 


530 


1.1 


4.80E+05 




N76D/31 03AA/1 04I/N204G* 


580 


1.4 


4.10E+05 




N76D/S1 03A/V1 04i/N204C* 


370 


1.3 


2.90E+05 


50 


N76/S103AA/104l/P210r 


500 


1.2 


4.20E+05 




N76D/S1 03A/V1 04I/L21 7H* 


80 


0.63 


1.30E+05 




N76D/S1 03AA/1 04I/IVI222A 


70 


3.1 


2.30E+04 




N76D/S1 03AA/1 04I/M222S 


80 


3.1 


2.60E+04 




N76D/S1 03A/V1 04lyT260P 


660 


1.5 


4.40E+05 


55 


N76D/S1 03AA/1 04iyS265N 


590 


1.3 


4.50E+05 



* These mutants made as per Example 5, all others made as per Example 2 
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Table V (continued) 



Enzyme 


kcat 


KM 


kcaVKM 






(mM) 


(s-"'lV|-'') 


K27R/N76DA/1 04Y/I1 07V/N1 23S 


220 


1.4 


1.60E+05 


K27R/N76DA/1 04Y/N1 23S/D1 97E 


430 


1.1 


3.90E+05 


K27R/N76DA/1 04Y/N123S/N204C 


400 


1.1 


3.60E+05 


K27R/N76DA/1 04Y/N123S/Q206L 


440 


1.2 


3.70E+05 


K27R/N76D/V1 04Y/N1 23S/S21 6V 


440 


1.2 


3.70E-H05 


K27R/N76DA/1 04Y/N1 23S/N21 8S 


760 


0.98 


7.80E+05 


K27R/N76DA/1 04Y/N123S/T260P 


410 


1.2 


3.40E+05 


K27R/N76DA/1 04Y/N1 23S/T274A 


390 


1 


3.90E-H05 


N76D/S1 03AA/1 04I/L126F/S265N 


170 


2.1 


8.10E+04 


N76D/S1 03A/V1 04I/S1 56E/S1 B6D* 


40 


6.3 


6.40E+03 


K27R/N76DA/1 04Y/N1 23S/G1 95E/G1 97E 


410 


0.98 


4.20E+05 


K27R/N76DA/1 04Y/N1 23S/G1 95E/N21 8S 


540 


0.66 


8.20E+05 


K27R/N76DA/1 04Y/N123S/D1 97E/N21 8S 


770 


0.79 


9.80E+05 


K27R/N76DA/1 04Y/N1 23S/N204C/N21 8S 


610 


0.99 


6.20E-I-05 


K27R/N76DA/104Y/N123S/Q206L/N218S 


580 


0.78 


7.40E+05 


K27R/N76DA/1 04Y/N123S/N21 BS/r260P 


660 


1 


6.60E+05 


K27R/N76DA/1 04Y/N1 2-3S/N21 8S/T274A 


590 


0.89 


6.60E-f05 


K27R/N76DA/1 04Y/Q1 09S/N123S/N21 8S^274A 


520 


1 


5.20E+05 


K27R/N76D/V1 04Y/N 1 23S/G1 95E/D 1 97E/N21 BS 


460 


0.65 


7.10E+05 


B. amylollquefaclens subtlllsin (BPN') 


50 


0.14 


3.60E+05 


BPN'-N76D/Y217L* 


380 


0.46 


8.30E+05 



Table VI 



Enzyme 


Thermal Stability 




(% half-life of native 




enzyme) 


B. lentus subtlllsin 


100 


N76D 


590 


N76D/S99D 


840 


N76D/S103A 


390 


N76DA/104I 


660 


N76D/I107V 


710 


N76D/N123S 


70 


N76D/S99D/S101R 


610 


N76D/S99D/S103A 


590 


N76D/S99DA/104I 


910 


N76D/S101R/S103A 


930 


N76D/S101RA/104I 


500 


N76D/S103AA/104I 


460 


N76D/S103GA/104r 


370 


N76D/S103AA/104F 


480 


N76D/S103AA/1 04N 


230 


N76D/S103AA/104S 


230 


N76D/S103AA/104T 


370 


N76D/S103AyV104W 


280 


N76D/S103AA/104Y 


400 



* These mutants made as per Example 5, all others made as per Example 2 
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Table VI (continued) 



Enzyme 


Thermal Stability 




(% half -life of native 




enzyme) 


N76DA/1 041/11 07V 


940 


N76DA/104Y/I107V 


820 


N76DA/104I/N123S 


80 


N76D/I107V/N123S 


150 


K27R/N76DA/1 04YyN1 23S 


100 


N76D/S99D/S1 01 R/S1 03A 


570 


N 76D/S99 D/S 1 0 1 FW1 04 1 


1000 


N76D/S99D/S1 03A/V1 041 


680 


N76D/S101 G/S1 03A/V1 041* 


390 


N76D/S1 01 R/S1 03A/V1 041 


470 


N76D/S1 03AA/1 04I/S1 05A* 


360 


N76D/S1 03AJV^ 04I/S1 05D" 


370 


N76D/S1 03A/V1 04T/I1 07A* 


270 


N76D/S1 03AA/1 04T/I1 07L* 


230 


N76D/S103A/V1 04I/N123S 


110 


N76DA/1 041/11 07 V/N 1 23S 


220 


N76D/S103AA/1 04I/L126A 


270 


N76D/S103A/V104I/L126F 


950 


N76D/S103AA/104I/L126I 


410 


N76D/S103A/V1 04I/L126V 


320 


N76D/S1 03AA/1 04I/S1 28G* 


640 


N76D/S1 03A/V1 04I/S1 28L* 


760 


N76D/S103AyV1 04I/L135A 


230 


N76D/S103AA/104I/L135F 


200 


N76D/S1 03AA/1 04I/L1 351 


510 


N76D/S103AA/1 04I/L135V 


500 


N76D/S1 03AyV1 04I/S1 56E* 


120 


N76D/S1 03AyV1 04I/S1 66D* 


590 


N 760/31 03A/V1 04I/D1 97E 


460 


N76D/S1 03AA/1 04I/N204A^ 


230 


N76D/S1 03A/V1 04I/N204G* 


240 


N76D/S1 03AA/1 04I/N204C* 


500 


N76D/S1 03A>V1 04I/P21 01* 


1370 


N76D/S1 03AA/1 04I/L21 7H* 


60 


N76D/S1 03AA/1 04I/M222A 


520 


N76D/S1 03AA/1 04I/M222S 


490 


N76D/S1 03A/V1 04I/T260P 


490 


N76D/S1 03AA/1 04I/S265N 


360 


K27R/N76D/V1 04Y/I1 07V/N 1 23S 


210 


K27R/N76D/V1 04Y/N1 23S/D1 97E 


120 


K27R/N76D/V1 04Y/N1 23S/N204C 


110 


K27R/N76D/V1 04Y/N1 23S/Q206L 


380 


K27FI/N76D/V1 04Y/N1 23S/S21 eV 


140 


K27R/N76D/V1 04YyN1 23S/N21 BS 


270 


K27R/N76D/V1 04Y/N1 23S/T260P 


40 


K27R/N76D/V1 04Y/N1 23S/T274A 


60 


N76D/S99D/S1 01 R/S1 03AA/1 041 


590 


N76D/S99D/S1 03A/V1 04iyN 1 23S 


110 
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Table VI (continued) 





Enzyme 


Thermal Stability 
(% half -life of native 


5 




enzyme) 




N 76D/S1 03A>V1 04I/L1 26 F/S265 N 


810 




N76D/S1 03AA/1 04I/S1 56E/S1 66D* 


220 




K27R/N76DA/1 04Y/N1 23S/G1 95E/G 1 97E 


90 




K27R/N76DA/1 04Y/N1 23S/G1 95E/N21 8S 


250 


10 


K27R/N76DA/1 04Y/N1 23S/D1 97E/N2 1 8S 


270 




K27R/N76DA/1 04Y/N1 23S/N204G/N21 8S 


460 




K27R/N76DA/1 04Y/N1 23S/Q206L/N21 8S 


1400 




K27R/N76DA/1 04Y/N1 23S/N21 8S/T260P 


310 


IS 


K27R/N76DA/1 04Y/N1 23S/N21 8SvT274A 


180 


N76D/S99D/S 1 01 R/S1 03AA/1 041/N 1 23S 


90 




K27R/N76D/V1 04Y/Q1 09S/N1 23S/N21 8S/T274A 


230 




K27R/N76DA/f 1 04Y/N1 23S/G1 95E/D1 97EyTM21 8S 


240 




B. amyloliquefaciens subtlllsin (BPN') 


100 


20 


BPN'-N76DA^217L* 


420 



Example 6 

Wash Performance Test 

25 

[0072] The wash performance of the variants described in the previous examples was evaluated by measuring the 
removal of stain from EMPA 118 (blood/milk/carbon black on cotton) cloth swatches (Testfabrics, Inc., Middlesex, NJ 
07030). 

[0073] Six ElViPA 11 6 swatches, cut to 3 X 4-1/2 inches with pinked edges, were piaced in each pot of a Modei 7243S 
Terg-O-Tometer (United States Testing Co., Inc., Hoboken, NJ) containing 1000 ml of water, 15 gpg hardness (Ca+-^: 
Mg++::3:1::w:w), 7 g of detergent, and enzyme as appropriate. The detergent base was WFK1 detergent from wfk - 
Testgewebe GmbH, Adierstrasse 42, Postfach 13 07 62, D-47759 Krefeld, Germany: 



35 



40 



Component 


% of Final Formulation 


Zeolite A 


25% 


Sodium sulfate 


25% 


Soda Ash 


10% 


Linear alkylbenzenesuifonate 


8.8% 


Alcohol ethoxylate (7-8 EO) 


4.5% 


Sodium soap 


3% 


Sodium silicate 


3% 


(SI02:Na20::3.3:1) 





45 

[0074] To this base detergent, the following additions were made: 



50 



Component 


% of Final Formulation 


Sodium perborate monohydrate 


13% 


Copolymer (Sokalan CPS) 


4% 


TAED (IVIykon ATC Green) 


3% 


Enzyme 


0,5% 


Brightener (Tinopal AMS-GX) 


0,2% 



[0075] Sodium perborate monohydratewas obtained from Degussa Corporation, RIdgefield-Park, NJ 07660. Sokalan 
CP5 was obtained from BASF Corporation, Parsippany, NJ 07054. Mykon ATC Green (TAED, tetraacetylethylenedi- 
amine) was obtained from Warwick International, Limited, Mostyn, Holywell, Clwyd CHS 9HE, England. Tinopal AMS 
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GX was obtained from Clba-Gelgy Corporation, Greensboro, NO 2741 9. 

[0076] Six EMPA 11 6 swatches were washed In detergent with enzynnefor 30 minutes at 60^*0 and were subsequently 
rinsed twice for 5 minutes each time In 1 000 ml water. Enzymes were added at final concentrations of 0.05 to 1 ppm 
for standard curves, and 0.25 ppm for routine analyses. Swatches were dried and pressed, and the reflectance from 
the swatches was measured using the L value on the L*a*b* scale of a IVilnolta Chroma IVleter, Model CR-200 (Minolta 
Corporation, Ramsey, NJ 07446). Performance is reported as a percentage of the performance of B. tentus (GG36) 
protease and was calculated by dividing the amount of B. fentus (GG36) protease by the amount of variant protease 
that was needed to provide the same stain removal performance X 100. The data are shown In Table VII. 



50 



10 


Table VI 






Enzyme 


Wash 

^) A ivf ,M a n A 

rB rro rrnari ce 




B. lentus subtilisin 


100 


IS 


N76D 


310 




N76D/S103A 


230 




N76D/V104I 


130 




N76D/I107V 


160 




N76D/S99D/S101R 


370 


20 


N76D/S99D/S103A 


290 




N76D/S101R/S103A 


130 




N76D/S101RA/104I 


300 




N76D/S103A/V104I 


320 


25 


N76D/S103GA/1 041 


160 




N76D/S103AA/104F 


210 




N76D/S103AA/104N 


110 




N76D/S103AA/104T 


170 




N76D/V1 041/11 07V 


210 


30 


N76D/S99D/S101R/S103A 


220 




N76D/S99D/S101RA/104i 


140 




N 76D/S 1 0 1 G/S 1 03 AA/ 1 041 


170 




IN /DU/ol Ul ri/ol Uorv V 1 U'H 




35 


N76D/S1 03AA/1 04i/S1 05A 


170 


N76D/S1 03AA/1 04T/I1 07A 


120 




N76D/S1 03A>V1 04T/I1 07L 


110 




N76D/S1 03AA/1 04I/L1 26F 


110 




N 76 D/S 1 03 AA/ 1 04 l/S 1 28G 


280 


40 


N76D/S1 03A/V1 04i/L1 351 


160 




N76D/S1 03/W1 04I/L1 35V 


160 




N 76 D/S 1 03 A/V 1 04 l/D 1 9 7 E 


170 




N76D/S1 03AA/1 04I/N204A 


160 


45 


N76D/S1 03AA/1 04i/N204G 


150 


N76D/S1 03A/V1 04I/P21 01 


470 




N76D/S1 03AA/1 04I/M222A 


100 




N76D/S1 03A/V1 04I/T260P 


280 




N76D/S1 03AA/1 04I/S265N 


190 



Example 7 

Protease Stability in a Liquid Detergent Formulation 



55 



[0077] A comparison of protease stability toward inactlvation In a liquid detergent formulation was made for Baciltus 
/e/?ft/s subtilisin and It's variant enzyme N76D/S1 03A/V1 041 according to the procedure outlined herein. The detergent 
formulation used for the study was a commercially purchased bottle of Tide Ultra liquid lanudry detergent made In the 
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USA by Procter & Gamble Company. Heat treatment of the detergent formulation was necessary to Inactivate In-situ 
protease. This was accomplished by Incubating the detergent at 96°C for a period of 4.5 hours. Concentrated prepa- 
rations of the B. fentus subtillsin and N76D/S103A/V104I variant, in the range of 20 grams/liter enzyme, were then 
added to the heat-treated Tide Ultra at room-temperature to a final concentratrion of 0.3 grams/liter enzyme in the 
detergent fomnuiation. The heat-treated detergent with protease added was then incubated in awater bath thermostat- 
ted at 60**C. Allq Jots were removed from the Incubation tubes at 0, 24, 46, 76, and 1 1 2 hour time Intervals and assayed 
for enzyme activity by addition to a 1 cm cuvette containing 1 .2 mM of the synthetic peptide substrate suc-Ala-Ala-Pro- 
phe-p-nltroanlilde dissolved in 0.1 M trIs-HCL buffer, pH 8.6, and thermostatted at 25''C. The Initial linear reaction ve- 
locity was followed spectrophotometricaliy by monitoring the absorbance of the reaction product p-nitroaniline at 41 Onm 
as a function of time. As shown In Fig. 1 0, the N76D/S1 03AA/1 041 variant was observed to have significantly greater 
stability towards inactivatlon than the native B. tentus enzyme, Estimated haif-iives for inactivation in the Tide Ultra 
detergent formulation for the two enzymes, under the specified test conditions, are 45 hours for B. lentus subtilisin and 
1 25 hours for the N76D/S1 03A/V1 041 variant. 

[0078] Throughout this application reference is made to various amino acids byway of common one- and three-letter 
codes. Such codes are identified In Dale, J.W. (1989), IVIolecular Genetics of Bacteria , John Wiley & Sons, Ltd., Ap- 
pendix B. 

[0079] Although the preferred embodiments of the invention have been described above, it will be obvious to those 
skilled in the art to which the Invention pertains, that, after understanding the Invention as a whole, various changes 
and eq u ivalent modifications may be made without departing from the scope of the invention as defined by the appended 
claims. 
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SEQUENCE LISTING 



10 



20 



50 



55 



(l) GENERAL INFORMATION: 



(i) APPLICANT: Graycar, Thomas P 
Bott, Richard R 
Wilson, Lori J 

(ii) TITLE OF INVEMTIQH: Subtiliain Variants 

(iii) MXn4BER OF SEQVEKCE5: 15 



(iv) CORRESPONDENCE ADDRESS: 

(A> ADDRESSEE: Gen&ncor Internatiotwl , Inc 

(B) STREET: ISO Kimball Way 

(C) CITY; So. San Francisco 
IS <D) STATE: CA 

(E) COIFH^RY: USA 

(F) ZIP: 94060 

<v) COMPUTER READABLE FORM: 

(A> HEDIUM TYPE: Floppy disk 
(B) COMPUTER: IBM PC compatible 
(C> OPERATING SYSTEM: PC-DOS/MS-DOS 
<D) SOFTWARES PatentIn Release Sl.O, Version #1.25 



(Vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER; 
{B) FILING DATEt 13-OCT-199 4 
iC) CLASSIFICATION: 

25 (Viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Horn, Margaret A. 

(B) REGISTRATION NUMBER: 3 3,401 

(C) REFERENCE/ DOCKET NUMBER: GC235-2 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPKOHEt (415) 742-7536 
30 iB) TELEFAX! (415) 742-7217 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
«B> TYPE; nucleic acid 

(C) STRANDEDNESS J single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic] 

40 (3ci) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

GAAGCTOCAA CTCGTTAAA 19 

(2) INFORMATION FOR SEQ ID NO: 2: 

(ii SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 18 ba86 pairs 

(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO; 2: 
GCTGCTCTAG ACAATTCG IB 
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(2) INFORMATION FOR 5EQ ID NO: 3: 

(ij SEQUENCE CHARACTERISTICS: 
[A^ LENGTH: 39 base pairs 
(B) TVPE: nucleic acid 
CO STBANDEDNESSr single 
10> TOPOLOGY: linear 

(ii) MOLECULE TYPE: DEOA. (gaziofflic) 



(Xi) SEQUIINCE DESCRIPTION: SEQ ID NO: 3: 

OTATTAGGGG CGGACGGTCG AGGCGCCATC AGCTCGATT 

(2) INFORMATION FOR SEQ ID NO: 4; 

(i) SEQUENCE CHARACTERISTICS: 
(A^ LENGTH: 33 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY; linear 

<ii) MOLECULE TYPE: DNA (genomic) 



(xi) SSQUENCE DBSCRIETION: S£Q ID N0:4: 

TCAGGTTCGG TCTCGAGCGT TGCCCAAGGA TTG 

(2) INFORMATION FOR SEQ lO NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
1A1 LENGTH: 22 base pairs 
(B^ TYPE; nucleic acid 
{CI STRAKDEDNESS: single 
{D} TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA {genomic) 



(sei) SBQTJENCE DESCRIPTION: SEQ ID MO:S: 

CACOTTGCTA GCTTGAGTTT AG 

(2) INFORMATION FOR SEQ ID NO:6i 

(1) SBQtJENCE CHARACTERISTICS: 

(At LENGTH : 1497 base pairs 

(B) TYPE: nuclolc acid 

(C) STRANDEDHESS E single 
CD> TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GOTCTACTAA AATATTATTC CATACTATAC AATTAATACA CAGAATAATC TGTCTATTGG 
TTATTCTGCA AATGAAAAAA AQQAGAGQAT AAAOAOTGAG AGOCAAAAAA GTATGGATCA 
GTTTGCTGTT TGCTTTAGCG TTAATCTTTA CGATGGCGTT CGGCAGCACA TCCTCTGCCC 
AGGCGGCAGG GAAATCAAAC GGGGAAAAGA AATATATTGT CGGGTTTAAA CAGACAATGA 
- 6CACGATGAG CGCCGCTAAG AAGAAAGATG TCATTTCTGA AAAAGGCGGG AAA6TGCAAA 
AOCAATTCAA ATATOTAGAC OCAOCTTCAG TCACATTAAA CGAAAAAQCT STAAAAGAAT 
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TGAAAAMGA 


OCCQAGCGTC 


GCTTACGTTG 


AAGAAGATCA 


CGTAGCACAT 


GCGTAC6CGC 


420 




AGTCCaTGCC 


TTACGGCGTA 


TCACAAATTA 


AAGCCCCTGC 


TCTGCACTCT 


CAAGGCTACA 


4B0 


5 


CTGGATCAAA 


TGTTAAAGTA 


GCGGTTATCG 


ACAGCGGTAT 


CGATTCTTCT 


CATCCTGATT 


540 




TAAAGGTAGC 


AAGCGGAGCC 


AGCATGGTTC 


CTTCTGAAAC 


AAATCCTTTC 


CAAGACAACA 


600 




ACTCTCACGG 


AACTCACGTT 


GCCGGCACAG 


TTGCGGCTCT 


TAATAACTCA 


ATCGGTOTAT 


660 


10 


TACGCGTTGC 


GCCAAOCGCA 


TCACTTTACG 


CTGTAAAAGT 


TCTCGGTGCT 


GACGGTTCCG 


720 


GCCAATACAG 


CTGGATCATT 


AACGGAATCG 


AGTGGGCGAT 


COCAAACAAT 


ATGGACGTTA 


^ao 




TTAACATGAG 


CCTCGGCGGA 


CCTTCTGGTT 


CTGCTGCTTT 


AAAAGCGGCA 


GTTGATAAAG 


340 




CCGTTGCATC 


CGGCGTCGTA 


GTCGTTGCGG 


CAGCCGGTAA 


CGAAGGCACT 


TCCGGCAGCT 


900 


IS 


CAAGCACAGT 


GGGCTACCCT 


G6TAAATACC 


CTTCTGTCAT 


TGCAGTAGGC 


GCTGTTGACA 


960 




GCAGCAACCA 


AAGACCATCT 


TTCTCAAOCG 


TAGQACCTGA 


GCTTGATGTC 


h.TGGCUCCTG 


1020 




GCGTATCTAT 


CCAAAGCACG 


CTTCCTGGAA 


ACAAATACGG 


GGCGTACAAC 


GGTACGTCAA 


1080 




TGGCATCTCC 


GCACGTTGCC 


GGAGCGGCTG 


CTTTGATTCT 


TTCTAAGCAC 


CCGAACTGGA 


1140 


20 


CAAACACTCA 


AGTCCGCAGC 


AGTTTAGAAA 


ACACCACTAC 


AAAACTTGGT 


GATTCTTTGT 


1200 




ACTATGGAAA 


AGGGCTGATC 


AACGTACAAG 


CGGCAGCTCA 


GTAAAACATA 


AAAAACCGGC 


1260 




CTTCGCCCCG 


CCGOTTTTTT 


ATTATTTTTC 


TTCCTCCGCA 


TGrrCAATCC 


GCTCCATAAT 


1320 




CGACGGATGG 


CTCCCTCTGA 


AAATTTTAAC 


GAGAAACGGC 


GGGTTGACCC 


GGCTCAGTCC 


1380 


25 


CGTAACGGCC 


AACTCCTGAA 


ACOTCTCAAT 


CGCCGCTFCC 


CGGTTTCCGG 


TCAGCTCAAr 


lAAO 




GCCATAACGG 


TCGGCGGCGT 


TTTCCTGATA 


CCGGGAGACG 


GCATTCGTAA 


TCGGATC 


1497 



<2) INFORMATION FOR SEQ ID N0:7t 

30 (i) CHARACTERISTICS: 

{A) LENGTH: 275 amino acids 

{B) TYPE: amino acid 

[C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

35 

ixLy SEQUENCE DESCRIPTION: SEQ ID N0:7: 

Ala Gin Ser Val Pro Tyi Gly Val Ser Gin lie Lys Ala Pro Ala Leu 
15 10 15 

His Smr Gin Gly Tyr Thr Gly Ser Asn Val Lys Val Ala Val He Asp 
20 25 30 

Sar Gly He Asp Ser Ser His Pro Asp Leu Lys Val Ala Gly Gly Ala 
35 40 45 

Ser Met Val Pro Ser 6lu Thr Asn Pro Phe Gin Asp Asn Asn Ser His 
50 55 60 

45 

Gly Thr His Val Ala Gly Thr Val Ala Ala Leu Asn Asn Ser He Gly 
65 70 75 BO 

Val Leu Gly Val Ala Pro Ser Ala Ser Leu Tyr Ala Val Lys Val Leu 
65 $0 95 

Gly Ala Asp Gly Ser Gly Cln Tyr Ser Trp II© He Asn Gly He Glu 
100 105 110 



55 
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Trp Ala lln Ala Asn Asn net Asp V&L lie Asn Het 5er Leu Gly Gly 
115 120 125 

Pro Ser Gly Ser Ala Ala Leu Lys Ala Ala Val Asp Lys Ala Val Ala 
130 135 140 

Ser Gly Val Val Val Val Ala Ala Ala Gly Asn Glu Gly Thr Ser Gly 
145 150 155 160 

Ser Ser Ser Thr Val Gly "Tyr Pro Gly Lyfi Tyr Pro Ser Val lie Ala 
165 170 175 

Val Gly Ala Val Asp Sez- Sax- Asn Gin Arg Ala Ser Phe Sex- Ser Val 
180 IBS 190 

Gly Pro Glu Leu Asp Val Met Ala Fro Gly Val Ser lie Gin Ser Thr 
195 200 205 

Leu Pro Gly Asn Lya Tyr Gly Ala Tyr Asn Gly Thr Ser Met Ala Ser 
210 215 220 

Fro His Val Ala Gly Ala Ala Ala Leu He Leu Ser Lys His Pro Asn 
225 230 235 240 

Trp Thr Asn Thr Gin Val Arg Ser Ser Leu Glu Asn Thr Thr Thr Lys 
245 250 255 

Leu Gly Asp Ser Phe Tyr Tyr Gly Lys Gly Leu lie Asn Val Gin Ala 
260 265 270 

Ala Ala Gin 
275 

IHFORIIATION FOR SEQ ID NOiS: 

(i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 275 amino acids 
(B^ TYPE: amino acid 
(C) STRANDEDKE5S : single 
{D\ TOPOLOGY: linear 

(ii) MOLECULE TYPE! protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Ala Gin Ser Val Pro Tyr Gly lie Ser Gin He Lys Ala Pro Ala Leu 
1 5 ID . 15 

His Ser Glii Gly Tyr Thr Gly Ser Asn Val Lys Val Ala Val lie Asp 
20 25 30 

Ser Gly He Asp Ser Ser Hie Pro Asp Leu Asn Val Arg Gly Gly Ala 
35 40 45 

Ser Phe Val Pro Ser Glu Thr Asn Pro Tyr Gin Asp Gly Ser Ser His 
50 55 60 

Gly Thr His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly 
65 70 75 80 

Val Leu Gly Val Ser Pro Ser Ala Ser Leu Tyr Ala Val Lys Val Leu 
85 90 95 

AsD Ser Thr Gly Ser Gly Gin Tyr Ser Trp He He Asn Gly He Glu 
100 105 110 

Trp Ala He Ser Asn Asn Met Aep Val He Asn Met Ser Leu Gly Gly 
115 120 125 

Pro Thr Gly Ser Thr Ala Leu Lys Thr Val Val Aep Lys Ala Val Ser 
130 135 140 
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Ser Gly lie Val Val Ala Ala Ala Ala Oly Asn Glu Gly Sar Ser Gly 
145 ISO 155 160 

Ser Thr Ser Thr Val Gly Tyr Pro Ala Lys Tyr Pro Ser Thr lie Ala 
165 170 175 

Val Gly Ala Val Asn Ser Ser Asn Gin Arg Ala Ser Fhe Ser Ser Ala 
IBO IBS 190 

Gly Ser Glu Leu Asp Val Met Ala Pro Gly Val Ser He Gin Ser Thr 
195 200 205 

Leu Pro Gly Gly Thr Tyr Gly Ala Tyr Asn Gly Thr Ser Met Ala Thr 
210 215 220 

Pro His Val Ala Gly Ala Ala Ala Leu lie Leu Ser Lye His Pro Thr 
225 230 235 240 

Trp Thr Asn Ala Gin Val Arg Asp Arg Leu Glu Ser Thr Ala Thr Tyr 

245 250 255 

Leu Gly A$n Ser Phe Tyr Tyr Gly Lys Gly Leu He Asn Val Gin Ala 
260 265 270 

Ala Ala Gin 
275 

INTORMATION FOR SEQ ID NO; 9: 

(i) SEGUEnCE CHARACTERISTICS: 

(A> LENGTH: 274 amino acidfi 
(B> TYPE: amino acid 
<C) STSANDBPNESS: sinflfle 
iD) TOPOLOGY: linear 

(11] MOLECULE TYPE: protein 



(xi] SEQUENCE DESCRIPTION J SEQ ID NO: 9: 

Ala Cln Thr Val Pro Tyr Gly lie Pro Leu lie Lya Ala Asp Lys Val 
15 10 15 

Gin Ala Gin Gly Fhe Ly* Gly Ala Asn Val Lys V«l Ala Val L«u Asp 

20 25 30 

Thr Gly lie Gin Ala Ser Hia Pre Aap Leu Aan Val Val Gly Gly Ala 
35 40 45 

Ser Phe Val Ala Gly Glu Ala Tyr ASH Thr Asp Gly Asn Gly His Gly 
50 55 60 

Thr Kis Val Ala Gly Thr Val Ala Ala Leu Asp Asn Thr Thr Sly Val 
65 70 75 SO 

Leu Gly Val Ala Pro Ser Val Sar Leu Tyr Ala Val Lys Val Leu Asn 
85 90 9S 

Ser ser Gly Ser Gly ser Tyr Ser Gly lie Val Ser Gly lie Glu Trp 
100 105 110 

Ala Thx Thr Asn Gly Met Asp Val He Asn »et Ser Leu Gly Gly Ala 
115 120 125 

Ser Gly Ser Thr Ala Met Lys Gin Ala Val Asp Asn Ala Tyr Ala Arg 
130 135 140 

Gly Val Val Val Val Ala Ala Ala Gly Asn Ser Gly Asn Ser Gly Ser 
145 ISO 155 160 

Thr A&n Thr He Gly Tyr Pro Ala Lys Tyr Asp Ser Val He Ala Val 
165 170 175 
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Gly Ala Val hsp Ser Asn Ser Asn Arg Ala Ser Phe Ser Ser Val Gly 
ISO IBB 190 

Ala Glu Leu Glu V&l Met Ala Pro Gly Ala Gly Val Tyr Ser Thr Tyr 
195 200 205 

Pro Thr Asn Thr Tyr Ala Thr Leu Asn Gly Thr Ser Met Ala ser Pro 
210 215 220 

His Val Ala Gly Ala Ala Ala Leu xle Leu Ser Lys His Pro Asn Leu 
225 23Q 235 240 

Ser Ala Ser Gin Val Arg Asn Arg Leu Ser Ser Thr Ala Thr Tyr Leu 
245 250 255 

Gly Ser Ser Phe Tyr Tyr Gly Lys Gly Leu lie Asn Val Glu Ala Ala 
260 265 270 

Ala Gin 

INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUEKCE CHAEUVCTERI STICS J 

(A) LENGTH; 269 ajttino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE; protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 10: 

Ala Gin Ser Val Pro Trp Gly Xle Ser Arg Val Gin Ala Pro Ala Ala 
15 10 15 

His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 

20 25 30 

Thr Gly lie Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 

Fhe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 
85 90 95 

Ser Gly Ser Gly Ser Val Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
IDO 105 110 

Gly Asn Asn Gly Het: Kis Val Ala Asn Leu Ser Leu Gly Ser Pro ser 
115 120 125 

Pro Ser Ala Thr Leu Olu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
130 135 140 

Val Leu Val Va.1 Ala Ala Ser Gly Aen Ser Gly Ala Gly Ser He Sor 
145 150 155 160 

Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
165 170 17 5 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp He 
ISO 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 
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Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 315 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin He 
225 230 235 240 

Ax-g Asn His L«u Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
245 250 255 

Tyr Oly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Aro 
260 2€5 

(2) INFORMATION FOR SEQ ID MO;ll{ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LSHGTH: 1X40 base pairs 
{B) TYPB; nucleic acid 
CO STRAND£msss: single 
CD) TOPOLOGY: linear 

Ui) MOLECULE TVPE; DNA (gencmic) 



ixi) SEQUENCE DESCRIPTION: SEQ ID HO: 11: 



ATOAAGAAAC 


CGTT6GGGAA 


AATTGTCGCA 


AGCACCGCAC 


rACTCATTTC 


TGTTGCTTTT 


60 


AOTTCATCGA 


TCGCATCGGC 


TGCTGAAGAA 


GCAAAAGAAA 


AA7ATTTAAT 


TGQCTTTAAT 


120 


GAGCAGGAAG 


CTGTCAGTGA 


GTTTGTAGAA 


CAAOTAGAGG 


CAAATGACOA 


GGTCGCCATT 


lao 


CTCTCTGAGG 


AAGAGGAAGT 


CGAAATT6AA 


TTGCTTCATG 


AATTTGAAAC 


GATTCCTGTT 


240 


TTATCCGTTG 


AGTTAAGOCC 


AGAAGATGTG 


GACGCGCTTG 


AACTCGA7CC 


AGCGATTTCT 


300 


TATATTGAAG 


AGGATGCAGA 


AGTAACQACA 


ATGGCGCAAT 


CAGTGCCATG 


GGGAATTAGC 


360 


CGTCTGCAAG 


CCCCAGCTOC 


CCATAACC6T 


GGATTGACAG 


GTTCTGGTGT 


AAAAGTTGCT 


420 


GTCCTCGATA 


CAGGTATTTC 


CACTCATCCA 


GACTTAAATA 


TTCGTGGTGG 


CGCTAGCTTT 


480 


GTACCAGGGG 


AACCATCCAC 


TCAAGATGGG 


AATGGGCATG 


GCACGCATGT 


GGCCGGGACG 


540 


ATTGCTOCTT 


TAAACAATTC 


GATTGGCGTT 


CTTGGGGTAG 


CaCCGAGCGC 


GGAACTATAC 


600 


GCTGTTAAAG 


TATTAGG6GC 


QAGCGGTTCA 


GGTTCGOTCA 


GCTCOATTGC 


CCAAGGATTG 


660 


GftATGGGCAG 


GGAACAATGG 


CATGCACGTT 


GCTAATTT6A 


GTTTAGGAAG 


CCCTTCGCCA 


720 


AGTGCCACAC 


TIGAGCAAGC 


TGTTAATAGC 


GCGACTTCTA 


GAGGCGTTCT 


TGTTGTAGCG 


7S0 


GCATCTGGGA 


ATTCAGGTGC 


AGGCTCAATC 


AGCTATCCGG 


CCCGTTATGC 


GAACGCAATG 


340 


GCAGTCGGAO 


CTACTGACCA 


AAACAACAAC 


CGCGCCAGCT 


TTTCACAGTA 


TGQCQCAGGG 


900 


CTTGACATTG 


TCGCACCAGG 


TGTAAACGTG 


CAGAGCACAT 


ACCCAGGTTC 


AAC6TATGCC 


960 


AGCTTAAACG 


GTACATCGAT 


GGCTACTCCT 


CATGTTGCAG 


GTCCAGCAGC 


CCTTGTTAAA 


1020 


CAAAAGAACC 


CATCTTGGTC 


CAATGTACAA 


ATCCGCAATC 


ATCTAAAGAA 


TACGGCAACG 


1080 


AGCTTAGGAA 


GCACGAACTT 


GTATGGAAGC 


GGACTTGTCA 


ATOCAGAAGC 


GGCAACACGC 


1140 



(2) INFORMATIOM FOR SEQ ID NOtIZj 

(i) SSQUEMCE CHARACTERISTICS : 

(A) LENGTH: 1140 base pairs 

(B) TYPE: nucleic acid 

(C) STRAWDEDNESS : single 
{D) TOPOLOGY: linear 

(ii) HOLECULS TYPE: CKA (genomic) 
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10 



IS 



20 



25 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

ATCSAAGAAAC CGTTGGGGAA AATTGTCGCA AGCACCGCAC TACTCATTTC TGTTGCTTTT 60 

AOTTCA7CGA TCGCATCGGC TGCTGAAGAA GCAAAAQAAA AATATTTAAT TGGCTTTAAT 120 

GAGCAGGAAG CTGTCAGTGA GTTTGTAGAA CAAGTAGAGG CAAATGACGA GGTCGCCATT 180 

CTCTCTGAGG AAGAGGAAGT COAAATTGAA TTGCTTCATG AATTTGAAAC GATTCCTGTT 240 

TTATCCOTTO AQTTAAQCCC AGAAflATOTQ GACGCGCTTG AACTCGATCC AGCGATTTCT 300 

TATATTGAAG AGGATGCAGA AGTAACGACA AO^GGCGCAAT CAGTGCCATG GGGAATTAGC 3€0 

COrGTGCAAG CCCCAGCTGC CCATAACCGT GGATTGACAG GTTCTGGTGT AAAAGTTGCT 420 

GTCCTCGATA CAGGTATTTC CACTCATCCA GACTTAAATA TTCGTGGTGG CGCTAGCTTT 480 

GTACCAGGGG AACCATCCAC TCAAGATGGO AATGGGCATG GCACGCATGT GGCCGGGACC 540 

ATTGCTGCTT TAGACAACTC C5ATTGGCOTT CTTGGCGTAG CGCCGAGCGC C3GAACTATAC 600 

GCTGTTAAAG TATTAGG6GC GAGCGGTTCA GGCGCCATCA GCTCGATTGC CCAAGCAT^G 660 

GAATGGGCAG GGAACAATGG CATGCACGTT GCTAATTTGA GTTTAGGAAG CCCTTCGCCA 720 

AGTGCCACAC TTGAGCAAOC TGTTAATAGC GCGACTTCTA GAGGCGTTCT TGTTGTACCG 780 

GCATCTGGGA ATTCAGGTGC AGGCTCAATC AGCTATCCGG CCCGTTA*rGC GAACGCAATG B40 

GCAC3TCGQAG CTACTGACCA AAACAACAAC CGCGCCAGCT TTTCACAOTA TQQCCCACCG 900 

CTTGACATTG TCGCACCAGC TGTAAACQTG CAGAOCAGAT ACCCAOGTTC AACGTATGCC 960 

AGCTTAAACG GTACATCGAT GGCTACTCCT CATGTTGCAG GTOCACCAGC CCTTGTTAAA 1020 

CAAAAGAACC CATCTTGGTC CAATGTACAA ATCCGCAATC ATCTAAAQAA TACGGCAACG 1080 

AGCTTAGGAA GCACGAACTT GTATGGAAGC GGACTTOTCA ATGCAGAAGC GGCAACACQC 1140 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHAKACTERISTICS : 

(A) LENGTH; 30 base pairs 

(B) TVPE: nucleic acid 
<C} 5TRANDEDNBSS J single 
it» TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

TATGCCAGCC ACAACGGTAC TTCGATGGCT 
(2) INFORMATION FOR SEQ ID NOfl4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPEt nucleic acid 
{C} STRAl^EDNESS t single 
(D) TOPOI^Y; linear 

MOLECULE TYPE: DMA (genomic) 
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l3ci) SEQUENCE DESCRIPTION: SEQ ID HO: 14: 
CACAOTTGCG GCTCTAGATA ACTCAATCGG T 
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<2} XNFORMATZOK FOR SEQ JD NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
(A} LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRAN0EDHE55: single 

(D) TOPOLOGY t linear 

(ii) MOLECULE TYPE: DMA <ge&omic) 



(xi> SEQUENCE DESCRIPTION; SEQ ID NO: 15: 
GCTGACGGTT CCGGCGCTAT TAGTTGGATC ATT 33 
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Claims 

20 1. Acarbonyl hydrolase variant having an amino acid sequence not found in nature derived from a precursor carbonyl 
hydroiase comprising a substitution of a different amino acid for a piuraiity of amino acid residues at a position in 
said precursor carbonyl hydrolase equivalent to +76 In Bacillus amyloliquefaciens subtlllsin, In combination with 
one or more amino acid residue posrtlons equivalent to those selected from the group consisting of -i-99, +101 , 
+103, +104, +107, +123, +27, +105, +109, +126, +128, +135, +156, +166, +195, +197, +204, +206, +210, +216, 

25 +217, +218, +222, +260, +265 and/or +274 in Sac///L/s amy/o//qfL/efec/e/issubtllisin. 

2. A variant according to Claim 1 wherein the precursor carbonyl hydrolase is a subtllisln. 

3. A subtllisln variant according to Claim 2 wherein a combination of substitutions is made at the positions equivalent 
so to 76/99, 76/101, 76/103, 76/104, 76/107, 76/123, 76/99/101, 76/99/103, 76/99/104, 76/101/103, 76/101/104, 

76/1 03/1 04, 76/1 04/1 07, 76/1 04/123, 76/1 07/1 23, 76/99/1 01/1 03, 76/99/1 01/1 04, 76/99/1 03/1 04, 76/1 01/1 03/1 04, 
76/1 03/1 04/1 23, 76/1 04/1 07/1 23, 76/99/1 01/1 03/1 04, 76/99/1 03/1 04/1 23, 76/99/1 01/1 03/1 04/123, 
76/1 03/1 04/1 26, 76/1 03/1 04/1 35, 76/1 03/1 04/1 97, 76/1 03/1 04/222, 75/1 03/1 04/260, 76/1 03/1 04/265, 
76/1 03/1 04/1 26/265, 27/76/1 04/1 23/274, 27/76/1 04/1 09/1 23/274, 27/76/1 04/1 23/21 8/274, 27/76/1 04/1 23, 

35 27/76/104/107/123, 27/76/104/109/123, 27/76/104/109/123/218/274, 27/76/104/123/197, 27/76/104/123/204, 

27/76/1 04/1 23/206, 27/76/1 04/1 23/21 6, 27/76/1 04/1 23/21 8, 27/76/1 04/1 23/260, 27/76/1 04/1 23/1 95/1 97, 
27/76/1 04/1 23/1 95/21 8, 27/76/1 04/1 23/1 97/21 8, 27/76/1 04/1 23/204/21 8, 27/76/1 04/1 23/206/21 8, 
27/76/1 04/1 23/21 8/260, 27/76/1 04/1 23/1 95/1 97/21 8, 76/1 03/1 04/21 7, 76/1 03/1 04/1 56, 76/1 03/1 04/1 66, 
76/103/104/105, 76/101/103/104, 76/103/104/128, 76/103/104/210, 76/103/104/107, 76/103/104/204, 76/217, 

40 76/1 03/1 04/1 56/1 66 and 76/1 03/1 04/1 28. 

4. A subtllisln variant according to Claim 3 wherein a combination of substitutions is made at the positions equivalent 
to 76/99, 76/101, 76/103, 76/104, 76/107, 76/123, 76/99/101, 76/99/103, 76/99/104, 76/101/103, 76/101/104, 
76/1 03/1 04, 76/1 04/1 07, 76/1 04/1 23. 76/1 07/1 23. 76/99/1 01/1 03, 76/99/1 01 /1 04. 76/99/1 03/1 04. 76/1 01/1 03/1 04. 

45 76/1 03/1 04/1 23, 76/1 04/1 07/1 23, 76/99/1 01/1 03/1 04, 76/99/1 03/1 04/1 23; 76/99/1 01/1 03/1 04/1 23; 

76/1 03/1 04/1 28; 76/1 03/1 04/260; 76/1 03/1 04/265; 76/1 03/1 04/1 97; 76/1 03/1 04/1 05; 76/1 03/1 04/1 35; 
76/1 03/1 04/1 26; 76/1 03/1 04/1 07; 76/1 03/1 04/21 0; 76/1 03/1 04/1 26/265 and/or 76/1 03/1 04/222. 

5. A subtllisln variant according to Claim 4selected from the group consisting of 76/99, 76/1 04, 76/99/1 04, 76/1 03/1 04, 
50 76/1 04/1 07, 76/1 01/1 03/1 04. 76/99/1 01/1 03/1 04 and 76/1 01/1 04. 

6. Asubtliisin variantaccordingtoCiaim4whereinthesubtlllsinvarlantscomprise: N76D/S99D; N76D/S101I=S; N76D/ 

S103A: N76D/V104i; N76D/I107V; N76D/N123S; N76D/S99D/S1 01 R; N76D/S99D/S1 03A; N76D/S99D/V1 041; 
N76D/S101R/S103A; N76D/S1 01 R/V1 04i; N76D/S1 03/W1 041; N76D/V1 041/11 07V; N76D/V1 04Y/i1 07V; N76D/ 
55 V104i/N123S; N76D/i1 07V/N1 23S; N76D/S99D/S1 01 R/S1 03A; N76D/S99D/S1 01 R/V1 04i; N76D/S99D/S1 03A/ 

V104I; N76D/S101R/S103/W104i; N76D/S1 03/W1 04I/N123S; N76D/V1 041/11 07V/N123S; N76D/S99D/S1 01 R/ 
S103AA/104i; N76D/S99D/S103A/V1 04i/N123S; N76D/S99D/S1 01 R/S103AA/104i/N123S; N76D/S103AA/1 041/ 
S128G; N76D/S103A/V104i/T260P; N76D/S103/W104i/S265N; N76D/S1 03AA/1 04i/D1 97E; N76D/S103/V 
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V104I/S105A; N76D/S103AA/1 04I/L135I; N76D/S103AA/f104l/L126F; N76D/S1 03AA/104T/L107T; N76D/S103A/ 
V104I/P210I; N76D/S103AA/104I/L126F/S265N and N76D/S1 03A/V1 04I/M222A. 

7. A subtilisin variant of Claim 5 wlierein the subtilisin variants comprise N76D/S99D, N76D/V104I, N76D/S99D/ 
V104I, N76D/S103AA/104I, N76D/V1 041/11 07V, N 76 D/ VI 047/11 07V, N76D/S101 R/S1 03AA/104I, N76D/S99D/ 
S101 R/S103AA/104I and N76D/S1 01 R/V104I. 

8. A subtilisin variant according to Claim 2 which Is derived from a Bacillus subtilisin. 

9. A subtilisin variant according to Claim 8 which is derived from Bacillus lentus subtilisin. 

10. DNA encoding a carbonyl hydrolase variant of Claim 1 . 

11. Expression vector encoding the DNA of Claim 10. 

12. Host cell transformed with the expression vector of Claim 11 , 
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CONSERVED RESIDUES IN SUBTILISINS FROM 
BACILLUS AMYLOUQUEFACiENS 

1 10 20 
AQSVF.G APA.H. .0 



21 30 
. TGS .VKVAV. 



41 

-D It . . . G G A S 

61 70 
, N . HGTHVAGT 

81 90 
VIjGVAPSA . LY 

101 110 
S G . . S . L . . G . 

121 130 
V . N . S Ij G . P S . 



141 

G V . 



40 

D . G . . . .HP 



60 
Q D 

80 

.AALNNSIG 

100 

AVKVLOA . G 

120 

E W A . N . . . . 

140 

3 # • • • » ■ * 

160 

A . G N . G . . . 

180 

. . . . A V G A . 

200 

.G..I1D. .A 

220 

. . y . . . N G T 

240 

A L . . . K . . . 

260 

NT. . . L 6 . . 



50 

. V P 



150 
V V A 



161 



17 0 
Y P . . Y 



181 190 
D . . N . .ASPS. 



201 
P G V 



210 
Q S T . P 6 



221 



241 



230 



SUA . PHVAGAA 



250 



W . . . Q . R . . It . 



261 



270 



. . y G . G L . N . . 
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